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DRUG METABOLIZING ENZYMES 
TECHNICAL FIELD 

This invention lelates to nucleic add and amino add sequences of drug metabolizing enzymes 
5 and to Ihe use of diese sequences in die diagnosis, treatment, and prevention of 

antortmniineAnflamniatoiy, cell piolifi^tive, developmoital, endociine, eye, metabolic, and 
gasHointestinal disoidm, inchding liver disorders, and in the assessment of die ejects of exogenous 
compounds on die enpression of micldc add and amino acid sequences of dmg nietabofizing ^izymes. 

10 BACKGROUND OF THE INVENTION 

The metabolism of a dmg and its movi^ent duou^ die body (phannacokinedcs) are 
inqiortaut in det^mining its efiiects, toxicity, and interactions widi other drugs. The three processes 
governing pharmacoldnjetics are d» absotption of the drug, disttibution to various tissaes, and 
elimination of drug metabolites, lliese processes ate intimately coupled to drug metabolism, since a 

15 variety of metabolic noodifications aher most of die physicodiemical and pharmacological properties of 
drugs, inchiding solubilhy, binding to receptors, and exacretion rates. The metabolic padxways wbidi 
modify drug;5 also accept a variety of natural^ occurting substrates sudi as st^ids, fatty acids, 
prostaglandins, leukotri^ics, and vitatDins. The enzymes in these pathways are therefore inq)ortant 
sites of biodiemical and pharmacological interaction between natural compounds, drugs, carcinogens, 

20 matagens, and xenobiotics. 

It has long been ^^reciated that inheiited difierences in dmg metabolism lead to drastically 
differ^levelsof dmgefibacy andtoxidty ainongindhddua^^ For drugs with narrow dierapeutic 
indices, or dmg3 whidi require bioactrvation (sudi as codeine), these polymorphisms can be onticaL 
Moreover, promising new dmgs are frequent^ eliminated in clinical trials based on toxicities which 

25 may only affect a segm^ of die patient group. Advances in {^larmacogenomics researdi, of which 
dmg metaboHztng ^izymes constttute an important part, are promising to expand the tools and 
information that can be bmught to bear on questions of drag efficacy and toxicity (See Evans, W. E. 
and R. V. Relling (1999) Sdeocc 286:487-491). 

Drug metabolic reactions are categorized as Phase I, which functionalize the dmg molecule 

30 and prepare it for fiurthc^metabolisin, and F In general, fhase I 

reaction products are partially or fully inactive, and Fhase II reaction products are the dbisi exoteted 
spedes. HDwevia:,Iliase I reaction product are sometimes inore active dian die oti^^ 
administered drog^; this soetsibolic activation principle is eaqpiotied by pro-drogs (eg. I^dopa). 
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Additionally, some nontoxic compoxmds (e.g. aflatoxiQ, benzo[a]pyiene) ate metabolized to toxic 
intermediates through these padiways. Fliase I reactions are usually rate-limiting in dmg metabolism. 
Prior exposure to the compound^ or other coixqxnijQds, can induce die expression of I^iase I enzymes 
however, and thereby increase substrate flux dirough die metabolic pathways. (See Klaassen, C. D., 

5 Anadur, M. Q. and J. Donll (1996) Casarett and DouFs Toxicology: The Basic Science of Poisons, 
McGraw-Hill, New York, NY, pp, 113-186; B. G. Katzong (1995) Basic and Clinical Phatmacolopv , 
Appleton and Lange, Norwallc, CT, pp. 4S-S9; 6. G. (Sbson and P. Skett (1994) Introduction to Dmie 
Metabolism, Blackie Academic and Professional, London.) 

DnigxnctaboHabig engines (DMEs) haw broad This can be 

10 contrasted to the immmift system, wh^ a large and divide population of antibodies are hi^y 

specific for dieir antigens. The abifity of DMEs to nietabolize a wide variety of niolecules creates the 
pot^itial ibr drag interactions at the level of metabolism. Bor example, the induction of a DME by one 
compound may afifect the metabolism of another compound by the enzyme. 

DMEs have been classified according to the type of reaction they catalyze and the co&ctors 

15 involved The major classes ofFbase I enzymes iiK^de^ but are n^^ 

flavinrcontainingmonoos^g^tiase. Oflii^ enzynoe classes involved in Iliasel-typecataJ^cc^^ 
reactions include, but are not limtted to, NADFH cytodirome P450 reductase (CPR), die microsomal 
cytochrome b5/NADH cytodmime b5 reductase system, the ferredoxin/fenedo^dn reductase redox 
pair, d^do/keto reductases, aiui alcohol dehydrogenases. The major classes of Phase n enzymes 

20 include, but are not limited to, HDP ghicuronylttansferase, sulfotrans^rase, glotaduone S-transfraase, 
N-acyltransierase, and N-acet^ transferase. 
Cytochrome F450 and P450 catalytic cvcle-assodated enzymes 

Members of the cytochrome P450 sup^^anuly of enzymes catalyze the oxidative metabolism 
of a variety of substrates, including natural compounds such as st^ids, fatty adds, prostaglandins, 

25 leukotrienes, and vitamins, as weH as drags, cardnogens, niutagens, and xenobiotics. Cytochromes 
P450, also known as P450 hemcr-diiolate proteins, xisually act as temiinal oxidases in muM-con^m^ 
ekctron transfer chains, caDedP45()-cQntainingnK>nooxygenasesyst Specific reactions 
catalyzed include hydro^lation, ^loxidation, N-^xidation, sulfooxidation, N-, S-, and 0~deaIkylations, 
desulfation, deamination, and reduction of azo, nitro, and N-oxide groups. These reactions are 

30 involved in steroidogenesis of glucocorticoids, Cortisols, estrogens, and androgens in animals; 

insectidde resistance in insects; herbidde resistance and flower coloring in plants; and cnviionm^ital 
biormnediation by microorganisms. Cytoc^iron^ P450 actions on drag;s, cardnogens, mutagens, and 
xetu)biotics can result in detoxification or in conversion of the substance to a more toxic product 
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Cytoduomes P450 arc abundant in the Bver, but also occur in other tissues; the enzymes are located 
in miciosonies. (See ExPASY ENZYME EC 1.14.14.1; Prosite PDOC00081 Qtochrome P450 
cysteine b^e-iron ligand signature; PRINTS EP450I E-Qass P450 Group I signature; Graiham- 
Loience, S. and Peterson, LA. (1996) FASBB J. 10:206-214.) 

5 Four hundred cytochromes P450 have been id^tified in diverse organisms including bacteria, 

fimgi, plants, and animals (Graham-Lorence, sugra). The B-class is found in prokaiyotcs and fungi, 
while the E-class is found in bactma, plants, insects, v^tebrates, and mammals. Hve subclasses or 
groups are found within flie laigea: fiamily of E-class cytochromes P450 (IMNTS EP450I BOass 
P450 Group I signature). 

10 AncytodMrornesP450useabenD©cofktoranddiarcstr^ Most 

cytochromes P450 are 400 to 530 amino adds in lengflL The secondary stracturc of flieeaizyme is 
gOwut 70% a^ha4ielicd and ifl)out 22% beta-sheet The legion around the heime-bhidingdtem the 
teminal part of Aeprotdn is conserved among cytochromes P450. A ten amino add signature 
sequence m tins hcane^iron ligand icgian has be«i identified wHdi includes a cons^ved cysteine 

15 mvolvedinbindSbg1hehesne]n>ninthefifQicooidjn hieukaryottccytodiromesP450»a 
nn^rane-spanmng re^on is usually found in die first 15-20 amino adds of the protdn, genefraOy 
consisting of ^proxunately 15 hydrophobic residues followed by a positively charged residue. (See 
Pro^ PD(XX)0081, supra : Graham-Loreaice, supra .) 

CytodnoniB P450 enzymes are involved in cell proliferation and development The enzymes 

20 have roks m diemical mutagenesis and cardnog^nesis by metabolizmg diemicals to reactive 
intnmediates ^ form a^cts with DNA G^ebert» D.W. and Gonzalez, F J. (19S7) Ann Rev. 
Biodifm 56:945-993). Tbese adducts can cause nucleotide dianges and DNA rearrangements lhat 
lead to oncogenesis. CyiodiromcP450caqpK5Ssioninliveramdodi^^ 
sndi as po^cycHc aromatic hydrocarbons, pEai>xisomal prolifbrators, pthenobarhital, and the 

25 gkicocorticoid dexamediasone (Dogra, S.C. et al (1998) Oul Exp. Pharmacol Physiol 25:1-9). A 
cytodunme P450 protein may participate in eye development as mutations in the P450 g^e CYPIBI 
cause primary congenital glaucoma (Onlii^ Mendelian Inhentance in Man (OMIM) *601771 
C^dnome P450, subfamity I (dioxin-inducible), polypeptide 1; CYPIBI). 

Cytoduomes P450 are associated with inflammation and infection. Hepatic cytodirome P450 

30 activities are profoui^fy ajQfected by various infections and inflammatory stimuli, some of wbidi are 
suppressed and some mduced (Morgan, RT. (1997) Drug Metab. Rev. 29:1 129-1188). Effects 
observed in vivo can be mimiclxd by proinflanunatory c^Knes and intoferons. Autoantibodies to 
two cytoduoiue P450 ptotdns were found in patients widi autoirmnxme polyenodoc^ 
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candidiasis-ectodearmal dystrophy (APECED), a polygtoidular autoimmmie syndrome (OMIM 
*240300 AtttoimriaiBc polyenodoorinopathy-K^andidiasis-e^dermal dystrophy). 

Mutations in cytochromes P450 have been linked to metabolic disorders, including congenital 
adrenal hyperplasia, the most conomon adrenal disorder of infancy and childhood; pseudovitamin D- 
5 defid^cy rickets; cerebrotendinonsxanthornatosis, a fipid storage disease char^^ 

progtessive neurologic dysfunction, prcmatoie atherosclerosis, and cataracts; and an inherited 
resistance to the anticoagulant drags coumarin and warfarin (Isselbacher, K. J. et aL (1994) Harrison's 
Principles of Internal Medicine, McGraw-IBIl, Inc. New York, NY, pp. 1968-1970; Takeyama, EL et 
aL (1997) Science 277:1827-1830; Kitanaka, S. et aL (1998) N. EngL J. Med. 338:653-661; OMIM 
10 *2137OOG»d>rotciidin0nsxaiijdiomatosis; and OMIM #122700 Extremely 
high levels of expression of die qytodhrome P450 protdn aromatase wore found in a fibrolamellar 
hepatoceOnlar cardnoma ftom aboy vn&k sevMe gynecomastia (feminization) (Ag^rwal, V.R- (1998) 
J. dUL EndocdrmoL Metab. 83:1797-1800). 

Tlie cytodirome P450 catal^c cycl& is con^lBted through redaction of cytochrome P4S0 by 
15 NADEBcytDdiroineP450 reductase (CTR). Another microsomal clcctnm transport systeni 
consisting of cytodiromeb5 andNADFHcytDchroniebS redactasehasbeen wide^fyvbwed as a 
. nainorconlributor of electrons to the cytodiroineP450cataly^^ However, a recent report by 
Lamb, D. C et aL (1999; FBBS Lett 462*283-8) identifies a Candida albicans cytodirome P450 
■ (CYKl) which can be effidenHy reduce^ and supported by the rmcrosomal cytodirome b5/NADPH 
20 . cytodirome bS reductase sj^st^n. Theiefore,fliereareliketyiiianycytodiromesP450 whidiare • • 
sui^rted by this a]texiiati[ve elsctron donor syst^ 

Q^tDdurome b5 reductase is also responsible for &e reduction of oxidized hemoglobin 
(metiiemogiobin, or ferrihemoglobin, which is unable to carry oxygen) to die active hemoglobin 
(ferrohemogiDbia) in red blood cells. Mediemogbbinenda results when there is a hi^ level of oxidant 
25 drugs or an abnormal hMnoglobin (hemoglobin M) whidi is not efficiently reduced. 

Metiiemoglobin^nia can also result firom a bereditary defidency in red cell cytochrome b5 reductase 
(Reviewed in Mansonr, A. and Lurie, A. A. (1993) Am. J. Hematol 42:7-12). 

Members of die cytodhtome P450 famitf are also closely assodated with vitamin D synthesis 
and catabolism. A^tamin D exists as two biolo^calLy equivaleait prohormones, ergocaldferol (vitamin 
30 D2), produced in plant tissues, and diokcaldferol (vitamin D3), produced in animal tissues. The latter 
form, diolecalcif^l, is formed npon flie e:qK>sure of 7-dehydrodio]esterol to near ultraviolet light (Le., 
290-3 10 nm), normal]^ resulting fnmi even Tninimal periods of sldn exposure to sunli^ (reviewed in 
Milter, WJL and Portale, AJL QOMX) Treuds EndocthioL Metab. 11315-319). 
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Bofli prohormone fomis are fbrlii^ metabolized in the liver to 25-liydT03^tainin D 
(25(OH)D) by the enzyme 25-hydjro3cylas6. 25(OH)D is the most abmidant precuisor form of vitamin 
D which must be fiirfher metabolized in flie Iddney to the active form, lc^25-dihydroxyvitamin D 
(la,25(OH)2D), by the enzyme 25-hydroxyvitamin D lorhydroxylase (la-hydroxylase). Regulation of 

5 la^(OE0i^ production is prirnarify at this final stiq> in the syt^ The activity of 

la-hydroxylase depends upon several jihysiolog^cal factors including die circulating level of the 
enzyme product (la^(OIQ^) and the levels of paratiiycoid homone (PTEQ, calcitonin, insulm, 
caldnmy phosphorus, gcowthhormoney and prolactin. Pdrfh^rmore, extrarenal la-hydrozylase activity 
has be^u reported, suggesting tiiat tissue-spedfic, local regulation of la,25(OH)2D production may 

10 also be biolo^calty important Ihecatalysisof lo^(OH)2D to24^-dihydbx^ 

(24,25(OH)2D), involving tibe enzyme 23-hydrDxyvitaniin D 24*hydtQxylase (24-hydroxylase), also 
occurs m the kidney. 2d4iydTDicylase can also use 25(QmD as a substrate (ShinkL T. et al (1997) 
Proc. NatL Acad. ScL U.S.A. 94:12920-12925; Miller, W.L and Portafc, AA. supra : and references 
within). 

15 Af^tanoin D 25-hydrox3dase, la-hydror^ase, and ZMiydroxylase are alt NADPH-depend^ 

typel (mitochaDdxia]) eytDditomeP450 enzyniBS that show afai^ d^^ of homology with oth^ 
membm of die family. Vitamin D 2S-hydrosylase also shows a broad substrate specificity and may 
also petfonn 2&bydn>xylation of bile acid intermediates and 25, 26, and 27-hydro^lation of 
dholestcrol (Dflwordi, FJ. et aL C1995) L Biol. Cbem. 270:16766-16774; MiDo:, W.L. and Portale, 

20 A.A supra ; and references widiin). 

The active form of vitamin D (la,25(OH)2D> is involved in caldum and phosphate 
homeostasis and promotes die difier^liation of myxoid and skin cells. Vitamin D defidenc^ resulting 
from defidendies in the ^izymes involved in vitamin D metabolism (e.g,, la-hydroxylase) causes 
hypocalcemia, hypophosphatemia, and vitamin D-de^ndent (sensitive) rickets, a disease diaracterized 

25 by loss ofbonedendty and dSstmctive clinical features, including bandy or bow legged 

acconqxanied by a waddling gait Defici^cies in vitamin D 25-hydroxylase cause cerebrot^dinous 
xandiomatosis, a lipid-storag^ disease diaracterized by die deposition of dioleste^l and diokstanol in 
die Adiilles' tendons, brain, lungs, and many other tissues. The disease presents widi progressive 
neurologic dysfonction, including postpubescent cerebellar ataxia, atherosderosis, and cataracts. 

30 Vitamin D 25-hydroxylase defidency does not result in rickets, suggesting the existence of alternative 
padiways for the synflsesis of 25(OH)D (Griffin, J.E. and Zerwekh, J.E. (1983) L Cain^ Invest 
72:1190-1199; GambBn, aX. et aL (1985) J. Clin. Invest 75:954-960; and WI^ and Portale, AJ^ 
supra) . 
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Ferredoxiii and ferredoxin reductase are electron transport accessory protdms which support 
at least one human cytochrome P450 species, cytochrome P450c27 encoded by the CYP27 gene 
(Dilworth, F. J. et al (1996) Biochem. J. 320:267-71). A Streptomvces griseus cytochrome P450, 
CYP104D1, was heterologonsly eiqjressed in E. coli and fofmd to he reduced hy flie endogpnous 

5 fencdo:3dn and fenedoxin reductase enzymes (Taylor, M. et al (1999) Biodiem. Biophys. Res. 
Conmnin. 263:838-42), suggesting that many cytochrome P450 spedes may be supported by flie 
fenedoxin/fbrredoxtn lednctase pair. Feiredozin reductase has also hcesx found in a model drug 
metabolism system to reduce actinomycin D, an antitumor antibiotic, to a reactive free radical spedes 
(Flitter, W. D. and Mason, R- P. (1988) ArdL BiodienL Biophys. 267:632-9). 

10 Flavin-con tafninp tno nooxveenase (FMO^ 

Fkvin-oontaining monooz^pnases oxidize Ifae nucleophiSc nitrog^ sulfur, and phosphorus 
hetmatomof an ezc^onal range of substrates. Like c^rtod]romesP450,FMC)5 are rmcxosomal 
and nse NADPH and Oj; th^ is also a great deal of substrate overly widi cytochromes P450. The 
tissue distribution of FMC)s includes liver» kidney, and hing. 

15 . Tbae are five different known isofbrms of FMO m mammals (PMOl, FM02, FM03, FM04, 

and FMOS),whi(^aieeaqpressedinatissae"Specj£ionDUi^^ The isofbrms di^mlheirsubstr^ 
spedfidties and o&er properties such as inhibition by various cono^ounds and st^:eospedficily of 
readioiL EMOs have a 13 amino add signature sequence, flie components of whidi span &b N- 
terminal two-thirds of the sequences and include the FAD binding region and &e FATGY motif which 

20 has been ibnnd in many N-hydroxylating razymes (Stehr, M, et aL (1998) Trends BiochenL ScL 
23:56-57; PRINTS FMOXYGENASE Flavin-contaming monooxygraiase signature). 

Spedfic reactions include oxidation of nucleophOic tertiary amines to N-oxides, secondary 
amines to hydroi^amines and nitrones, primary amines to hydro:iQrlanm^ and oximes, and sulfhr- 
containiDg compounds and {^osphines to S- and P-oxides. Hydrazines, iodides, selenides, and boron- 

25 containing compounds are also substrates. Althongh FMCfe ^PP^ similar to cytochromes P450 in 
flieir chj^Qoistry, ftiey can g^i^aHy be distinguished from cytodiromes P450 in vitro based on^ for 
exan^lc, die K^ij^ heat lability of FMC)s and the nomonic detCTgc«t sei^ 
however, use of diese properties in id^tification is compficated by forlher variation among FMO 
isoforms wi& respect to thermal stability and det^gent sensitivity. 

30 FMOs play important roles in the metabolism of several drags and xeaiobiotics. FMO (FM03 

in hver) is predominant^ responsible for metabolizing (S)-nicotine to (S)-nicotine N-l'-oxidc* whicOi is 
excreted in urine. FMO is also involved in S-osygpnadon of dmetidine, an H^-antagpnist widely used 
forfilet^Bal^Katltofg^stric1lkeIs. Liver-expressed forms of FMO are not imderfliesarDe regulate 
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control as cytodiroine P450. In rats, for example, phenobafbhal treatment leads to the indaction of 
cytochrome P450, but the repression of PMOl. 

Endogenous substrates of FMO include cysteamine, which is oxidized to the disulfide, 
cystamine, and trimefhylamine (TMA), whidi is metabolized to trimethylamine N-oxide. TMA smells 
5 like rotting fish, and mutations in the EM03 isoform lead to large amounts of the malodorous free 
amine being excreted in sweat, urine, and breath. These symptoms have led to the designation fish^ 
odor syndrome (OMIM 602079 TrimefhylamhHiria). 
Lysyi oxidase : 

Lysyl oxidase (lysine 6-oxidase» LO) is a copper-dep^dent anune oxidase involved in the 

10 foiBiaiion of connectii^ tissue matrices by <m>ssliiik^^ LOissecaietedasaN- 
glycos]^ted precnior protdn of aj^iroxfaiiately SO kDa Jjevels and cleaved to the mature form of tiie 
en^me by a metaDoptoteasey alfiLong^ precoisor form is also active. The copper atom in LO is 
involved in the transport of electron to and firom oxygen to fadlitate the oxklative deanoination of ^ine 
lesiduesin&eseextracellalarmfEtrixprotBins. Whifediccoordinationofcoj^ris essential to 1X3 

15 activi^, insnjEScioDt dietary intake of copper does not inflnence the expression of the apoenzyme. 
Howevi^, the abs^ce of tibe fimcdonal LO is Imked to the skeletal and vascular tissue disord^ that 
. aieassodated wiaidietatycoppea>deficieiicy. LO is also ihhibitedby a variety of semicarbasddes, 
hydrazines* and amino nitrites, as well as heparin. Beta-aminopocopionitrile is a commonly used 
inhibitor. LO activity is increased in response to ozone, cadmium, and elevated levels of hormones 

20 released in response to local tissue trauma, such as transforming growth f actor-4)eta, platelet-derived 
growtiifiu:tor,ang^otsDsinlI,andfiAmU AhnormaBties in LO activity has been 

Uniked to MeadoBS syndrome and ocdtpitall^ Cytosolic forms of the enzyme hae been, 

impficated in abnojonal ccD proliferation (reviewed in Rncfca-, R-B. et aL (1998) Am. J. Clin. Nutr. 
67:996S-1002S and SmiflKMongp. Li and Kagan, RM. (1998) Matrix BioL 16:387-398). 

25 Dihvdrofplate reductases 

Whydrofolate reductases (DHFR) are ubiquitous enzymes that catalyze the 
NADPH-depeaident reduction of dihydrofblate to tetrahydrofblate, an cssootial step in the denovo 
synthesis of glycine aiMl porines as well as the conveision of deoxyuridine monophosphate (dUMP) to 
deoxylhymidine monophosphate (dTMP). The basic reaction is as follows: 

30 

7,8^ydrofolate + NADPH 5,6,7,8-tetrahydrofolatB + NADP 
The eazymss can be inhibited by a number of dihydrofolate analogs, including trimethroprim and 
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mediottexate. Since an abundance of TMP is reqiiiicd for DNA synthesis, rapidly dividing cells 
require fhc activily of DHFR. The replication of DNA yiruses (Le., herpesvirus) also requires high 
levels of DHFR activity. As a result, drags that target DHFR haVe been used for cancer 
c^iemotherapy and to inhibit DNA virus replication. (For dmilar reasons, thymidylate synthetases are 

5 also target enzymes.) Drags that inihibtt DHFR ate preferentiaDy cytotoxic for rapidly dividing cells 
(or DNA vims-infected cells) but ha^ no specificity^ resulting in the indiscriminate destraclion of 
dividing cells. FiuthmiiDre^ canc^ ceDs may become resistant to drags sudi as methotrexate as a 
resnltof acqpiired transport defects or Ibe duplication of one or more DHFR g!»]e^ (1988) 
Biochemistry . W.H Freeman and Co., Inc. New Yodc pp. 511-5619). 

10 Aldo/keto reductases 

Aldo/kacto reductases are monomeric NADFH-depeudcnt oxidoreductases widibroad 
substrate spedficides (Bobien, K. M. et al (1989) J. BioL Chem. 264:9547-51). Tbese enzymes 
cata^ze liie redoctioiL of catbonyl-containing compounds, including carbonyl-containing sugars and 
aromatic conc^unds, to fte corresponding alcohols. Therefore, a variety of carbonyl-containing drags 

15 and zjcnobiolics are likely metaboli2sed by ^izymes of 

One known reaction catalyzed by a family memb^, aldose reductase, is the reduction of 
ghicose to soibito],wliic^ is then forth^ metabolized to Under 
normaL conditions, ttie reduction of ghicose to sorbitol is a minor pathway. In hyperglycemic states, 
however, die accumulation of sorbitol is implicated in the development of diabetic conq>lications 

20 (OMM'^IOSSSO AMo-keto reductase &iDilyl,mea])berBl)i Members of this ^izyme family are 
also highly expressed in some Bvear cancels (Cao, D. et aL (1998) J. BioL Chem. 273:11429-35). 
Alcohol dehydrogenases 

Alcohol ddhydrogenases (ADBs) oxidize simple aicohok to tihe cdrrespondhig ald^ydes. 
ADH is a cytosoHcenzyme^ prefix the cofactorNAD^, and also binds zinc ior^ liver contains the 

25 highest levels of ADH, with lower levels in kidney, hmg, and the gastric mucosa. 

Ejiown ADH isoforms are dimedc proteins conoposed of 40 kDa subunits. There are five 
known gene loci which encode diese subtmits (a, b, g, p, c), and some of the loci have charact^ized 
aOelic variants (bj, bj, bg, g^, gj). The subumts can form homodimers and heterodimers; (he snbunit 
composition determines the spedfic properties of the active enzyme. The holoenzymes have therefore 

30 been categprized as Class I (subnnit compositk)ns aa, ab, ag, bg, gg). Class II (pp), and Class III (cc). 
Class I ADH isozymes oxidize edianol and other small a}i]]diatic alcohols, and are inhibited by pyrazole. 
Class n isozymes prej^ longer chain aliphatic and aromatic alcohols, are imahle to oxidize methanol, 
and are not inhibited by pyrazole. Class IH isozymes prefl»^ even longer chain aliphatic alcohols (five 
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carbons aad longer) and aromadc akohols, and aie 

The short-duim alcohol dehydrog^iases include a munb^ of lelated enzymes with a variety 
of substrate spedfidlies. Included in fliis group aie the nnaniTnaliflii enzymes D-beta-hydro::!!ybutyrate 
ddnydrogenase, (R>-3-hydio^Tityrate ddiydiog^nase, 15-hydiox^iostagiandin dehydrogenase, 
5 NADFH-d^ndent catbonyl ledactase^ corticosteroid 11-beta-dehydrogenase, and estradiol 17-beta- 
dehydtogenase, as well as die hact»ial enzymes acetoacetyl-CoA reductase, glucose 1- 
ddiydrogjenase, 3-beta-hydro^steioid ddiydrog^ase, 20-beta-hydro:Eysteroid ddbydrog^nase, nbitol 
dehydrogenase, 3-^xoacyl reductase, 23-4ihydro-23-dihydrosyb^izoate dehydrogaiase, sorbjtol-6- 
phos[diate 2-^^ydrogpnase» 7-aIpha-hydro^t^id d^ydtogonase, ds-l^-^ydroxy-3,4- 

10 cyclohexadiene-l-carboxylate ddiydro^oase, ciJhtohicm> dihydrodiol dchydrogeaiase, c^^-beaizene 
glyool ddiydrogenase» biphcny]-23-^ydio-23-diol dehydrogenase^ N-acyhnannosamine 1- 
ddiydxogenase, and 2-deoxy>D-ghiconate 3-ddiydtog«iase CSlrozowsld, Z. (1994) J. Steroid 
Biochem. MoL Biol 51:125-130; Ktozowski, Z. (1992) Mol Cell BndocrinoL 84£25-31; and Marks, 
AJL et al (1992) J. Biol Chean. 267:15459-15463). 

15 TJDP ghicuronvitransfi^asc 

M^iib^ of die UDP ^curonyltransferase fsaxdSiy (UGTs> catalyze the transfer of a 
. ghicuronk. add grotqp from die cefaclor uridine dqphosphate-glucuronic add (UDP-^curonic acid) to 
:a substrate. The transfer is general!^ to a imcleophi]icheteroatom(0»N, or S). Substrates include 
xenobiotics whidi harve been functionalized by Phase I reactions, as well as endogenous conqiounds 

20 such as bilirubin, steroid hormones, and thyroid hormones. Products of ghicuronidation are excreted in 
urineif the molecular of the substrate is less dian about 250 g/molyWhi^^ t 
gincnronidated substrates are excreted tu bile. 

UGTs are located in file microsomes of fiver, kidney, intestine, skin, brain, spken, and nasal 
mucosa, whore diey are on die same side of the endoplasmic reticulum membrane as cytochrome 

25 P450 enzymes and fiavin-contalning monooxygenases, and dierefore are ideally located to access 
products of Phase I drag metabolism. UGTs have a Oterminal membrane-spanning domain whidi 
anchors in the endoplasmk: reticulum noi^nibrane, and a conserved signature domain of about 50 
amino add residues in diear C tenmnai section (Prosite PDOC00359 UDP-glycosyltransferase 
signature). 

30 UGTs involved in drug metabofism are aicoded by two gene femilies, UGTl and UGT2. 

MCTbers of die UCjTI family result fnmi 

variable substrate bindii^ domidn and constant legbn involved in cofactor binding and niCTbiane 
ins^tion. Meaiibers of flieUGTZ femiiy are e3K»ded by separate gjene lod, and are divided into two 
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families, UGT2A and UGT2fi. The 2A subfamily is expiessed in olfactory cpithefium, and the 2B 
subfamil/ is e:xptessed in fiyer xmcrosonies. Mutations in UGT g^ies are associatied with 
hyp^ilirubineima (OMIM #143500 Hyp^biliriibinemia I); Crigler-Najjar syndrome, characterized by 
intexisehypeibilinibineniia from birth (OMIM #218800 Crigl^-Najjar syndrome); and a milder form of 
S hypeibilinibinemia termed raibeat's disease (OMM 
Sulfotrap Rferasft 

Sulfate conjugation occors on many of flie same substrates which undergo O-gtacuronidation 
to produce a IdgUy water-soluble snifoiic acid est^. Salfotransferases (ST) catalyze ttiis reaction by 
transferring SOi from the cofector 3 -piiospiioad^i0sine^5'-piios{)lLosa]fate (PAPS) to the substrate. 

10 ST substrates are predominantly phenols and aUfdiatic alcohols, but also include aromatic annnes and 
aliphatb amines, which areconjugated to produce the corresponding sulfamates. The products of 
tbsse reactions are excreted niainly in nrme 

STs are fiound in a wide rangp of tissues, including hver, Mdney, intestinal tract, long, platelets, 
and brain. The enzymes are g^nerany<rjrtosolic, and nxult^le form For 

15 exanrplB^therearernorethanadozenfiamisof STinratlivc^rc^tosoL These biochemically 
diaracterized STs M into five classes based on tfidbr substrate preference: arylsuftfbtransjferase, 
alcohol su]jfbtransferase» estrogen soUbtransf&rase, tyrodne ester sulfotransf erase, and bile salt 
solfotransfBrase. 

ST enzyme activity varies greatly with sex and age in rats. The comibined effects of 
20 developmental cues and sex-related hormones are thought to lead to these differences in ST 

expression profiles, as well as the profiler of other DMEs such as cytochromes P450. Notably, the 
faig^ eapressioiL of STs in cats partially compensates for their low level of UDP giacuronyltransf»ase 
activity. 

Several forms of ST have been purified fivm human liver cytosol and cloned. There are two 
2S {d}eiu>lsulfotransfta:ases with differ^t thermal sta^ The thermostable 

^izyme cata^zes ttie sulfiation of i^henols such as para-nitrophenol, minoxidil, and acetaminophen; the 
diermo]abil& ^izyme pref^ monoamine substrates sudh as dopamine, epine]dirine, and levadopa. 
Other cloned STs include an estrogen sulfotransferase and an N-acetylglucosaraine-6-O- 
snlfotraiisf^rase. This last raizyme is illustrative of the other major role of STs in cellular biochemistry, 
30 the modification of carbohydrate stmctnres that may be im5)ortant in ceBular differentiation and 

maturation of proteoglycans. Indeed, an inherited defect in a sulfotransfbrase has be^ in^licated in 
macular corneal dystrophy, a disorder dtaracterized by a failure to synflicsizD mature keiatan sulfate 
proteoglycans (Nakazawa, K. ^ aL (1984) J. Biol Chem. 259:13751-7; OMIM *217800 Macular 
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dystrophy, corneal). 
GalactDsvltransfeniscs 

Galactosyltransfeimes aie a subset of giycosyllraiisferases fhat transfer galactose (Gal) to 
fhe terminal N-a(xtj^liicosaii]uie (GkiNAc) oligosaccharide dbains that aie part of glycoproteins or 

5 glycolipids that are fiee in solution (Kolbinger, F. et aL (1998) L Bk>l Cbem, 273:433-440; Amado, M. 
et al (1999) Biocbim. Biophys. Acta 1473 35-53). Galactosyltransferascs have been detected on the 
ceE surface and as soluble extracelhilarptoteins^m P13* 
galactosyltransferasesfonniypelcaibohydrat&diainswilbGi^ Known 
famnan and mouse pl3-gnlactosy]ttansferases appear to have a short cytosofic domffln, a single 

10 transmembrane domain, and a catalytic doinain wifli 6^ (Kolbing^, F. supra and 

Hbnnet, T. et aL (1998) J. BioL Chcm 273:58-65). In mouse lIDP-galactDse:P-N-ace^glucosamine 
P13-galactosy]transfbrase-I region 1 is located at amino add residues 78-83, region 2 is located at 
amino add residues 93-102, region 3 is located at amino add residues 116-119, region 4 is located at 
amino add residues 147-158, region 5 is located at amino add residues 172-183, region 6 is located at 

15 atxuno add residues 203-206, re^on 7 is located at amino add residues 236-246, and region 8 is 
located at amino add residues 264-275. A variant of a sequeiK:e found wi&tnmonseUDP- 
galactose:p-N-ac^]glucosanune pl3-galactosyltransferase-I region 8 is also found inbactmal 
galactosyltransferases, su^esting fliat Uns sequence defines a galactosylttansferase sequence motif 
(BCennet, T. stt|ffa). Rbc^ work suggests thatbraimac protein is a pl3-galactosyl^^ 

20 (Yuan, Y. et aL (1997) Cell 88:9-11; and Hennet, T. supra) . 

UDP-GaLGk:NAc-l,4-galactosyltransferase (-l,443alT) (Sato, T. et aL, (1997) EMBO J. 
16:1850-1857) cata^zes flie formation of Type II caibobydrate diains with Gal (Pl-4)GlcNAc 
linkages. As is tiie case widi die pl3-galacto53dtrans;forase, a soluble form of die enzyme is formed 
by ckavage of die HKmbrane-bound fonn. Amino adds conserved among pi, 4- 

25 galactosyltransferases include two cysteines linked ibrou^ a disulfide-bonded and a putative UDP- 
galactose-binding site in the catatytic domain (Yadav, S. and Brew, E. (1990) J. BioL Ch^n. 
265:14163-14169; Yadav, S.P. and Brew, K (1991) L KoL Chem. 266:698-703; and Shapes, N.L. et 
aL (1997) J. BioL CbsoL 27231389-31399). pl,4-galactosyltransfcrases have several specialized 
roles in addition to syntiiesizing carbohydrate diains on glycoproteins or gJycoKpids, In mammals a 

30 pi,4-g^lactosyltransfenise, aspartof aheterodinier witha-lactalbumin, to 

mammary gland Lactose production. A pi,4-ga]actos^transferase on ibe surface of sperm functions 
as a rec^)torttiatspedfica% recognizes^ egg. Cell surface pi,4-galactosyitransferases also 
fonction in cell adhc^n, ceB/basal lamina intraaction, and normal and metastatic cell migration. (Shur, 
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B. (1993) Oiix. OpirL Cell BioL 5:854-863; and Sh^r, J. (1995) Adv. Exp. Med. BioL 376:95-104). 
Ghitathionc S-transfcrasc 

The basic reaction catalyzed by ghatatihione S-transfemes (GST) is the conjugatioji of an 
electrophile with reduced ghitathione (GSH)» GSTs are homodimjeric or heteroditnenc proteins 

5 localized mainty in die cytosol, but some level of activity is present in microsomes as weE The major 
isozymes share common stnictm^ and catalytic properdes; in bumans they have been classified into 
four major classes, A^jha, Mu, H, and Theta. lie two largest classes, Al^a and Mu, are id^tified 
by dieir respective protehi isoelectric points; pi - 7.5-9.0 (Alpha), and pi - 6.6 (Mu). Eadi (2ST 
possesses a common binding site for GSH and a variable hydroph^ Thehydroi^obic 

10 btfiding sitft in each isoTyme is speofic for pardcolar elcctrophilic snbstrates. Specific amino acid 
residues widun GSTs have be^ identified as mpoitsaat for these binding sites and for catalytic 
activity. Eesidue& Q67, T68, DlOl, B104, and R131 are hnportant for die binding of GSH (Lee, 
^ aL (1995) J. BioL Chem. 270: 99-109). Residues R13, R20, and R69 arc important for the catalytic 
acti^ of GST (Straibeig G et aL (1991) Biochcm. J. 274: 549-55). 

15 Inmost cases, GSTs pecfomi the beaiefidal fionctionof deactivatbn and detoxification of 

' .pot^itialfy matag^c and cardnogenic dii^xdcals. However, in some cases Ifaeir action is detrimental 
and results in activation of dii^cakvrilb consequent nmtagenica^ Some 
forms of rat and human GSTs are reliable preneoplastic markers that md in the detection of 
• carcinog^ieas. FTpr^'H^" »fT»iiTifiaii r;gTp inimptftfiai gtraing, gnrh as Salmnttftll atvpliimamum used 

20 in die wellrlaiown Ames test for mntagenicity, has lie^>ed to establish fiie role of these enzymes in 
matagjenesis. X>iha]onou^ianes, wfaidi produce liver tumors in mice, are believed to be activated by 
GST. This view is supported by the finding diatdihaloinediancs are mo^ 

exprcssmg human GST than in untransfected ceDs (Thier, R. aL (1993) Proc. NatL Acad. Sd USA 
90: 8567-80). The mutagenicity of e^ykm dibromide and ethjd^ie didhlotide is increased in bacterial 

25 cells e:xpressing die human Al^ha GST, Al-1, while die mutagenicity of aflatoxin Bl is substantially 
reduced by enhancing d^ expression of GST (Simula, T.P. et aL (1993) Carcmogenesis 14: 1371-6). 
Thus, control of GST activitymay be usdW in die coutiol of mutagqnesis and cardnogenesis. 

GST has been implicated in die acquired resistance of many canc^ to drug treatment, the 
l^henomenon known as muM-drug resistance QMDR). MDR occurs when a cancer patient is treated 

30 with a cytotoxic drug such as cyclopbosphamide and subsequently becomes resistant to this dmg and 
to a vari^ of odier cytotoxic agfaits as welL increased GST levels are associated widi some of 
these drugresistant caucus, and it is believed diat this increase occurs in response to die drug agent 
whkii is thra deactivated by die GST catatyzed GSH corgugation reaction. The hicreascd GST levels 
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then protect the cancer cells from other cytotoxic agents whidi bind to GST. Increased levels of 
Al-l in tumors has been linked to drag resistance induced by cyclophosphamide treatment (Dirven 
H.A. et aL (1994) Cancer Res. 54: 6215-20). Thus control of GST activity m cancerous tissues may 
be nseftil in treating MDR in cancer patients. 
5 Gamma-ghitamvl transpetytidase 

Gamma-ghitam^ transp^tidases are ubiquitously expressed enzymes that initiate extracellular 
glutathione (GSH) breakdown by cleaving gamma-ghitamyl amide bonds. The breakdown of GSH 
provides cells with a regional cyst^e pool for biosynthetic pathways* Gamroa-ghitamyl 
transpeptidases also contdbute to cellular antioxidant defenses and expression is induced by oxidative 

10 stetess. lire cell sutface-localized glycoprotEjiis.are e3q)ress Studies 
have suggested that the hi^ level of gamtna-^tamyl transpeptidases activity present on the surface 
of cancer cells could be exploited to activate precursor dro g$, resulting in lugh to cal concentrations of 
anti-cancer flierapentic agents (Hanigan> M.H. (1998) Chem. Biol Interact 111-112:333-42; 
Tanigudii, N. and Itcda, Y. (1998) Adv. Enrjnnol Relat Areas Mol BioL 72:239-78; Chikhi, N. et aL 

15 (1999) Comp. Biodian. VbyuoL B. Biochem. Mol BioL 122:367-80). 
Ac^transferase 

N-ac^ltran&ferase exxzymes catalyze the transfer of an amino acid conjugate to an activated 
carbo3^c group. Endog^ns componnds and xenobiotics are activated by acyl-Co A synthetases in 
die cytDsol, microsomes^ and mitodiondria. Ihe acyl-CoA intermediates are <hen conjugated widi an 

20 amino add (typically gtycdne, ghitamine» or taurine, but also omithine, argunne, histidine, sorine, aspartic 
add, and several dipeptides) by N-ac^transfetases in the cytosol or mitodiondiia to form a metabolite 
wilh an amide bond. Uns reaction is con^lementary to Q-gtocuronidation, but amino acid conjugation 
does not prodoce die reactive and toxic metabolites whidx often result £rom ghicuronidation. 

One well-characterized csaymo of this class is die bile acid~CoA:amino add N~acyltrans^:ase 

25 (BAT) responsible for gpn^rating the bile add conjugates whidi serve as detergents in the 

gastrointestinal tract (Falany, Q K et aL (1994) L Biol Ghem, 269:19375-9; Johnson, M. R, et aL 
(1991) J. BioL Chem. 266:10227-33). BAT is also useful as a predictive indicator for prognosis of 
hepatoceltnlar carcinoma patieaits after partial hepatectomy (Fiaratani, M. et aL (1996) H^mtology 
24:1441-5). 

30 Acetvltransiefases 

Ace^transf^rases have been ejrtensively studied for thdr role in histone Histone 
ace^^on results in tiie relaxing of tibe diromatin structure in eakaryotic ceOs, allowing transcription 
foctors to gain access to promoteir demesits of die DNA ten^lates in the affected region of die 
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genome (or tlK genome in general). In contrast, histone deacetylation results in a reduction in. 
transcription by closing tibc cfatomatin stmcttuc and limiting access of transmption factors. To this 
end, a common means of sdmnlating cell transcriptian is the use of chranical agents Ifaat inhibit Ifae 
deacetylation of liistones (e*g., sodium butyrate), resulting in a global (albeit aitiEactual) increase in 

5 g^e esipression. Ihe modulatLon of gene expression by acet^lation also results fi^ 

of other proteins, including but not limited to, p53, GATA-1, MyoD, ACTR, TFDB, TFIIF and flie hi^ 
mobility group proteins (HMG). In the case of pS3,acetylation results in increased DNA binding, 
leadingtoth&sttnondationoftramcdptionofgesiesregplatedbypS The prototy^nc histone acetylase 
CHAT) is GcnS from Sacdtaromvces ootevisiae. GcnSisamemb^of afamilyof acetylasesthat 

10 incbdes Tetrahymc^p5S» human GonS, and human p3()0/^ Histone acetylation is revbwed in 
(Cheung, WX. et aL ^000) Curr. Opin. OB BioL 12326-333 and Berger, S.L (1999) Curr. Qpn. CeH 
Biol 11:336-341). Some acetyltransfisrase enTj^mes posses die al^ha/beta hydrolase fold (Obiter of 
Applied Molecular Eqgineering Inst of Chemisttyaod Biochemistry -XMversfty of Salzbiiig, 
htlp:/^redictsang^.ac.u]EA]l>m'<»urse97/DocsAnsO common to several other major classes of 

15 enzymes, including but not limited to, acet^chohnestorases and carboxylesterases (Structural 
Cnas^ficadon of Proteins, ht^://scopaiUK>lnib.cam.aauk/scopfindexhtn^^ 
. N-ac^^iatransfsrase 

Aromatic amines and hydrazine-containitig conqmunds are subject to N-acetylation by flie N- 
acet/lttansferase en^mes of livi^ and othor tissues. Some xenobiolics can be O-acetylated to some 

20 extent by the same ^izymes. N-ace^ltrans^iises are cytosolic enzymes which uttli?^ 

acetyl-coenzyme A (acetyl-CoA) to transfer ihe acetyl gronp in a two step process. In the first step; 
the acetyl group is transferred from ac^l-Co A to an active site cystdme residue; in fiie second step» 
the acetyl group is transferred to the substrate amino group and die ^izyme is regenerated. 
In contrast to most ods&r DMB classes, diere are a limited number of Inown N- 

25 acetyltransferases. In humans, there are two highfy similar Mizymes, NATl and NAT2; mice appear 
tohaveathirdformof 1heenz^e,NAT3. The human forms of N-acetyhxansferase have 
independent regulation (NATl is widely-expressed , whereas N AT2 is in liv«r and gut only) and 
oY^lapping substrate pref^mces. Both enzymes appear to accept most substrates to some extent, 
but NATl does pref^ some substrates (para-aminobenzoic add, para-aminosalicylic acid, 

30 sulfamethoxazole, and sulfanilamide), while NA12 prefers others (isoniazid, hydralazine, procainamide, 
da{>sone, aminoghitethimide, and sulfamethazine). 

Cfinical observations of patients taking die antituberculosis drug isoniazid in the 1950s kd to die 
deso^qptionoffestandslowacetylatorsof thecompoui^ These phenotypesw^ shown 
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subsequently to be due to mutations in the NAT2 gene which affected eaizyme activity or stability. 
The slow isoniazid acetylator phenotype is very prevalent in Middle Eastern populations (approx. 
70%), and is less prevalent in Caucasian (approx. 50%) and Asian (<25%) populations. More 
rec^tily, functional polymorphism in NATl has been detected, wilh approximately 8% of the 
5 population tested showing a slow acetylator phraiotype (Butchea*, N. J. et aL (1998) Pharmacogenetics 
8:67-72). Since NATl can activate some knovm aromatic amine caidnogens, polymorphism in the 
widely-expressed NATl enzyme may be important in detmnining cancer risk (OMM *108345 N- 
acet^transfefase 1). 
Aminotra nsfaases 

10 Aminotransfinrascs conqirise a &mily of pyddoxal 5 -phosphate CE^LP) -dep^id^ ^izymes that 

catalyzetransfbrmationsof ammo adds. Aspartate aminotransferase (AspAT) is the most 
extraisiyely studied PXJP-containitig esjayniQ. It catai^^zes the reversible transamination of dicarboxyfic 
Lr-amino adds» aspartate and gbtamate, and ttie corresponding 2-oxo adds, oxakcetate and 
2*-oxoghitarate. OAer members of the fytnSty included pymvate aminotransferase, brandied-chain 

15 ' ammo add anunotransferase, tyrosine amiDotransfiraiase, aromatic aminotransfoase, aiiaDinerglyoxylate 
aminotnuisfbrase (ACH^, andl7aar«Qin& aminotrans:^^ (Vacca,ILA. et aL (1997) J. BioL Cbem. 
272i21932-21937). 

Pmnaty hyperoxahria ^pe-1 is an autosomal recessive disorder re^ 
liver-spedfic perosdsomal enzyme, alaninerglyDxylate aminotransferase-l. The phenotype of the 

20 disord^isadefidencyin^yoxylateni^abolisnL In the absence of AGT,gLyo3gdate is oxidized to 
oxalate rather than bdngtraosatninated to glyctne. The result is the deposition of insoluble caldum 
oxalate in ^ kidneys and minaty tract, ultimately caniang lesial faibie (Lunib, M J. et aL (1999) J. 
BioL C3ienL 274:20587-20596). 

Kynurenine aminotransf^ase cata^zes the irreversible transamination of the L-trj^tophan 

25 metabolite Lr-kynurenine to form kyunrmc acid. The enzyme may also catalyzes the reversible 

transamination reaction b^ween Lr2-aminDadipate and 2-oxogIutarate to produce 2-oxoadipate and 
L^gitatamate. Kynure^ add is a putative modulator of ghitamatergic neurotransmission, thus a 
defidency in kynurenine aminotransfi^iise may be associated with pleotrophic e:^cts (Bndili, R. et 
aL (1995) J- BioL Ch^ 270:29330-29335). 

30 Co pper-zinc superoxide dismutases 

(}oppCT-ztnc sup^oxide dismutases are con]^)act homodimeric mctaHoenzymes involved in 
c^hdar defcaises against oxidative damage- The ^izymes contain one atom of zinc and one atom of 
copper per subunit and catalyze the dismntation of superoxide anions into O2 The rate of 
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disnmtation is diffusion-linuted and consequently enhanced by the presence of favorable elecfiostatLc 
int^iactions between &e substrate and eaazyme active site. Examples of fliis class of enzyme have 
been identified in the cytoplasm of all the enkaiyotic ceBs as well as in the periplasm of several 
bacterial spedes. Copper-zinc sapexoxide dismatases are robust enzymes that are highly resistant to 

5 proteolytic digestion and d^iatnring by urea and SDS. In addition to the compact stmctare of the 
enzymes, tiie pres^ice of Ate metal ions and intrasubmut disulfide bonds is believed to be responsible 
for enzyme stability. IlieaoEzymesmidergo reversible d^iatniadon at t^^ 
(Battistoid, A. et aL (1998) J. BioL Chem. 273:5655-5661). 

Overexpression of superoxide disnmtase has been impficated in enhancLag freezing tokiance of 

10 transgenic Alialfa as well as providiog resistance to ^vironmental toxins sndbi as ibc dtphenyl efiier 
herbicide, adfhiorfen (McKearsie, B.D. et aL (1993) Plant Physiol 103:1155-1163). In addtion, yeast 
ceUs become more resistant to fce^se^Chaw damage following eaposore to hydrogen peroxide whidi 
causes &e yeast ceSs to adapt to foriher peroxide stress by upreg^lating e:q»:ession of superoxide 
dismntases. fo this study, mntations to yeast supeinxidedismntasegpnes had a more deltime 

15 eEGcct OEU £reez&-diaw resistance than noatations wiiidh a^cted fiie regulation of ghitathione 
metabolism, Ismg suspected of bdng inqwrtant in determinmg an organisms survival throng the 
process of ctyopreservation (Jong-la Paric, J-L et aL (1998) J. BioL Chem. 273:22921-22928). 

Expression of superoxide dismntase is abo assodated wifli Mycobacterium tuberculosis, the 
organism foat causes tuberculosis. Superoxide disnmtase is one of the tea major protdns secreted by 

20 M. tuberculosis and its expression is upregolated approximately 5-fold in response to oxidative stress. 
M. tuberculosis expresses almost two orders of magnitude more superoxide disnmtase &an the 
noDjiafliogettic mycobacterium NL smegmatis , and sec^tes a much hi^ier proportion of the e:qn:essed 
enzyme. The result is flie secretion of ~350"fold more oizvme by M. tuberculosis than M. smegmatis , 
providing substantial resistance to oxidative stress (Harth, G. and Hbrwitz, MA. (1999) J. BioL Chem. 

25 274:4281-4292), 

The reduced expression of copp^-zinc superoxide dismntases, as well as other enzymes with 
anti-oxidant csqpabibties, has been impScated in tibo earty stages of cancer. The e35>rcssion of copper- 
zinc superoxide dismutases has been shown to be lower in prostatic intra^ithelial neoplasia and 
prostate carctnomas, compared to normal prostate tissue (Bostwick, D.G. (2000) Cancer 89:123-134). 
30 Phosphodiesterases 

Pliosphodibst^ases make up a class of en^mes which catalyze the hydrolysis of one of the 
two ester bonds in a i^osphodiester compound. Phosphodiesterases are therefore crucial to a variety 
of celhilar processes. Phosphodiesterases indnde DNA and RNA endonucleases and exonuclaases. 
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wMcli arc essential for cell growth and replication, and topoisoinrarases, which break and rejoin nucleic 
add strands during topological rearrangement of DNA. A Tyr-DNA phosphodiesterase functions in 
DNA repair by bydrolyzing dead-end covalent intermediates formed between topoisomerase I and 
DNA (Pouliot, J. J. et al. (1999) Science 286:552-555; Yang, S.-W. (1996) Eroc. NatL Acad Sol 

5 USA 93:11534-11539). 

Acid sphingomyelinase is a phosphodiesterase which hydrolyzes the membrane phospholipid 
sphingomyelin to produce coramide and phosphorylchoHne. Fhosphorylcholine is used in the synthesis 
of phosphatidylaholine, whidi is involved in nmnerons intracellular signaling pathways^ whOe coramide 
is an essential precursor for the generation of ganghosides, membrane Hpids found in high 

10 concentration in neural tissue. Defective add sphingomyelinase leads to a build-up of sphingpmyelin 
molecules m lysosomes, resulting in Niotnann-Kck disease (Sdmchman, BJBL and S.R. Miranda 
(1997) Genet Test 1:13-19). 

Glycerophosphoryl diester phosphodiest^:ase (ako Imown as glycerophospbodkster 
phosphodiesterase) is a phosphodiest^-asc which hydrolyzes deacetykted phospholipid 

15 gtycetophosphodiesteis to produce sn-glycerol->3^ito Gfycocofhosphodioline^ 
gftycerophosphoelhanDlaimne, glycerophosphogl^fcerol, and glycetophosphoinositol ate examples of 
substrates for glycocophosphoiyl diesfc^ phosphodiesterases. A glycerophosphoxyl diester 
pbosphodiestBrase fiom B. coH has bxoad spedfidty for glyceFophosfdiodiester substrates ([.atson, 
TX et aL (1983) J. Biol Chem. 248:5428-5432). 

20 Cyclic nucleotide phosphodiest^ases (PDBs) are crudal en^s^mes in the regulation of the cyclic 

nucleotides cAMP and cGMP. cAMP and cCSMP function as intracellttlar second messenger 
transduce a variety of extr^^eBular dgoals including hormones^ light, and neurotransmitt^. FDBs 
degrade cyclic nucleotides to dieir conesponding monoj^osphates, thereby regulating the intracellular 
conceiitrationsof cydicnudeotidesanddidrefiB^ Due to fhek roles as 

25 regulators of sagoal transduction, FDBs have been eiEteusively studied as diemofiii^peutic targ^ 
(Pfeoy, MJ. and GA, Higgs (1998) Can. Opin. Chem. Biol 2:472-481; Toiphy, J.T. (1998) Am. J. 
Rcsp. Crit Care Med. 157-351-370). 

Families of mammalian PDEs have heen classified hased on Iheir suhstrate spedfid^ and 
affinity, sensitivity to cofactois, and sensitivity to inhibitory agents (Beavo, J.A. (1995) PhysioL Rev. 

30 75:725-748; C3pnti, M. et al (1995)_Endqciine Rev. 16:370-389). Several of these families cpnt^ 
distinct g^i^, many of v^ch are eigiressed in different tissues as spU^ 

families, diexe are multtple isozymes and multiple splice variants of these isozymes (Conti, M. and S.- 
L.C Jin (1999) Ftog. NucHc Add Res. Mot BioL 63:1-38). The existence of multiple FD£ families. 
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isozymes, and splice variants is an indication of the variety and complexity of the regulatory pathways 
involving c^Hc nucleotides (Houslay, MD. and G. MiDigan (1997) Trends Bioch^ Sd. 22:217-224). 

Type 1 FDEs (PDBls) are Ca^/cahnodulin-depCTident and appear to l?e encoded by at least 
three different genes, each having at least two different splice variants (Kakkar, R. et al (1999) Cell 
5 Mol. life Sd. 55:1164-1 186). PDEls have been fbnnd in the hing, heart, and brain. Some PDEl 
isozymes are regulated in vitro by phosphorylation/dephosi^orylatioiL Phosphorylation of these PDEl 
isozymes decreases the affinity of the enzyme for calmodulin, decreases PDE activity, and increases 
steady state levels of cAMP (Kakkar, sapra) . PDEls may provide nseM thearapeutic targets for 
disorders of tiie cc^otral nervous system, and the caidiovascolar and inmmne systems due to fli© 
10 mvolvement of PDBls in both cyclic nucleotide and caldum signaling (POTy, MJ. and G.A. Higgs 
(1998) Curr. Opin. Obean. BioL 2:472-481). 

PDE2s are cGMP-stiraolated EDEs that have been found in tiie cerebeDnm, neocortex, heart, 
kidney, lung, pulmonary artery* and skeletal mnscle (Sadhn, K. ^ aL (1999) J. Hstochem. Gyfochem. 
47:895-906). PDB2s are tiiongjit to mediate the efltets of cAMP on catecholamine sectetion, 
15 participate in the r^olation of aldost^ne (Beavo, snpra) , and play a role in olfactDry signal 
transducrion (Jnilfe, D.M. et aL (1997) Ptoc. NatL Acad. Sd. USA 94:3388-3395). 

PDE3s have high affinity for botii cG^^P and cAMP, and so Ihese cycfic nucl^ 
con^petitive sabstrates for PDE3s. PDE3s play roles in stimulating myocardial conlractilily, inhibiting 
platebt aggregation, relaxing vascular and airway smootii muscle, inhibitiiig proliferation of T- 
20 lynqKhocytes and cultured vascular smootii mnsde ceSs, and regulating catecholamme-indnced release 
offree fotly acids fiom adipose tissue. The PDE3 £amify of phosphodiesterases are sensi^ 
specific inhibrb(»:s such as dObstanude, enoxinaone, and Bxazmone. Isozymes of PDE3 can be 
regulated by cAMP-depea^tea protein kinase, or by insuBn-dcpendcnt kinases (Degeiman, R ^ aL 
(1997) J. BioL Chem. 272:6823-6826). 
25 PDE4s are specific for cAMP; are localized to airway smooth muscle, the vascular 

endothelium, and all inflammatory ceBs; and can be activated by cAMP-dependent phosphorylation. 
Since elevation of cAMP kvels can lead to suppression of inflammatory cell activation and to 
relaxation of bronchial smootii musck, PDE4s have been studied extensively as possible targets for 
novel anti-inflainmatory agents, with special emphasis placed on the discovery of asthma treatments. 
30 FDB4 inhibitors are currently undergoing clinical trials as treatroMits for asthma, chronic obstructive 
pulmonary disease, and atojHc eczema. All four known isozymes of PDE4 are susceptible to the 
inhibitor rolptam, a compound whidh has been shown to inqirove behavioral mranory in mice (Barad, 
M. et aL (1998) Proc NatL Acad. ScL USA 95:15020-15025). PDE4 inhibitors have also been 
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studied as possible thcarapcwtic ageaits against acute lung injvuy, endotoxemia, Aeomatoid arthritis, 
multiple sclerosis, and various neurological and gastrointestinal indk:ations (Doherly, A.M. (1999) 
Curr. Opin. Chemu Biol 3:466-473). 

PDB5 is hi^ selective for cGMP as a substrate (Turko, LV. et aL (1998) Biochemistry 

5 37:4200-4205), and has two aUosteric cGMP-specific binding sites (McAllister-Lucas, L.M. et aL 
(1995) J. Biol Chem. 270:30671-30679). Binding of cGMP to fliese allosteric binding ates seems to 
be in^rCant for phosphorylation of IDES by cGMP-dep«tident protdn kinase rafiu^ than for direct 
tegplation of catalytic activity. Hi^ tevels of IDE5 are found in vascular smootfiinascl&,platel^ 
hmg, and Mdney. The inhibitor zaidnast is effective against IDES and ^ ModificatioiLof 

1 0 zaprinast to provide specificity against FDE5 has resulted in sildenafil (VIAGRA ; Pfizer, Inc. , New 
Yolk NY), a treatment for male erectile dysfunction (Tot^ N. et at (1996) Bioorg. Med. Chem. 
Lett 6:1819-1824). Inhibitois of PDE5 are currently bdng studied as ag^ for cardiovascular 
flierapy (Perry, MJ. and GA. Higgs (1998) Cuir. Opin. Chem. Biol 2:472-481). 

PD£6s, Ac photoreceptor cyclic nucfeotide phosphodiesterases, are crucial components of die 

15 j^ototransdoction cascade. Inassoduationwiththe&proteintransdudn,PDE6shydto^^ 

regulate cGKfP-gated cation channels in photorec^tor membranes. In addition to tiie cGKfP-bindtng 
active site, PDE6s. also have two hi^-aGBnity cGMP-bindmg sites whidi are thought to play a 
regnlatory rofc in FDE6 function (Artemyev, N.O. et al (1998) Mefliods 14:93-104). Defects in 
PDB6s have been associatBd with retinal disease. Retinal degeneration in the rd mouse (Yan, W. et 

20 aL (1998) Invest OpthalmoL Vis. ScL 39:2529-2536), autosomal recessive retinitis pigmentosa in 
humans (Danciger, M. et aL (1995) Genomics 30:1-7), and rod/cone dysplasia 1 in hish Setter dogs 
(Sttber,Mi.^aL (1993) Proc. Nad. Acad Sd.IJSA 90:3968-3972) have beea attributed to 
mutations in die PDE6B g^e. 

ThePDET fiamEty of PDBs consists of only one known m^nber having multipfe splice variants 

25 (Bloom, T J. and J.A. Beavo (1996) Proc. Natl. Acad. ScL USA 93:14188-14192). PDE7s are 

cAMP spedfic, but littb else is known about their 0iysiolo^cal function. Atthough mFNAs encoding 
PDE75 are found in skeletal muscle, heart, bram, lung, Iddney^ 

proteins is restricted to spaafic tissue types (Han, P. et aL (1997) J. BioL Chem. 272:16152-16157; 
Pfcrry, M J, and G.A. Higgs (1998) Curr. Opin. Chem. BioL 2:472-481). PDE7s are very closely 
30 related to the PDE4 family; however, PDE7s are not inhibited by rolipram, a spedfic inhibitor of 
PDE4s fBcavo, supra) , 

IT)E8s are cAMP specific, and are closety related to fl^PDB4feimly. PDE8s are eapressed 
inlhyroid gland, testis, eye^]iv^,skektalrinisclB, heart, M ThecAMP- 
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hydrolyzing activity of PDE8s is not inhibited by Ihe PDE inhibitors rolipram, vinpocetine, milrinone, 
IBMX (3-isobntyl-l-methylxaiilhinB), or zaprinast, but PDE8s are inhibited by dipyridamole (Fisher, 
D.A. et aL (1998) Biochem. Biophys. Res. Commnn. 246570-577; Hayashi, M. et aL (1998) 
Biochem. Biophys. Res. Commnn. 250:751-756; Soderling, S.H. et aL (1998) Proc. NatL Acad. ScL 
5 USA 95:8991-8996). 

PDE9s are cGMP spedfic and most closely resemble flie PDE8 family of PDEs. PDE9s are 
expressed in kidney, Hver, long, brain, spleen, and small intestine. FD£9s ate not inhibited by sildenafil 
(VIAGRA; Pfizer, Lac, New Yoj± NY), rolipram, vinpocetine, dq»yddamol&, or IBMX (3-isobotyl-l- 
mediil^xanfliinc), but fiiey are sensitive to flie FDB5 inhibitor zffpnaast (Hsher, D. A. et aL (1998) J. 
10 BioL Ch^. 273:15559-15564; Soderimg, S.BL et aL (1998) J. BioL Chem. 273:15553-15558). 

FDE10saiedaa]-sabstrateFDBs,bydiolyzmg both cAMP and FDElOs aie expressed 

inbi^ fiiytoid, and testis. (Sodding, S.H. et aL (1999) Proc. NatL Acad. Sou USA 96:7071-7076; 
Fujishige, K et al. (1999) J. BioL Qran. 274:18438-18445; Lon^y, K et al (1999) Gtene 234:109- 
117). 

15 PDEs are composed of a catalytic domain of about 270-300 ammo acids» anN-terminal 

regulatory domain responsible forbindii^ cofactors, and, in some cases, abydrophiKc C-tenninal 
dconam of mdmown fnnctian (Conti, M. and S.-L.C Jin (1999) Prog. Nucteic Acid Res. MoL BioL 
63:1-38). Aa>nserved,putatiYeziiic4)jndingmotif,HDXXIK(S^^ 

catalytic domain of all PDEs. N-t^minal regulatory domains include non-catalytic cGMP-binding 
20 domains in PDB2s, PD£5s, and PDE6s; calmodalinrbinding domains in PDEls; and domains 
containing phosphorylation sites in PDE3s and PDB4s. In PDBS, the N-terminal cGMP-binding 
domain spans about 380 amino add residues and comprises tandem repeats of the conserved 
sequence motif N(R/K)XiiFXy[)B (McADister-Lucas, L.M. et aL (1993) J. BioL C3ienL 268:22863- 
22873). The NEXnDmotifhasbe^ drown by mutag^iesis to be important for cG^^ 
25 (Tuiko, LV. et aL (1996) J. BioL Chem. 271:22240-22244). PDE families display approximately 30% 
amino add identity within tiie catalytic domain; however, isozymes within die same family typicaDy 
display abont 85-95% identity in tibis region (e.g. PDE4A vs PDE4B). Fiirflxermore, within a famify 
diere is extensive similarity (>60%) outside the catalytic domain; while across families, there is litde or 
no sequence similanty outside this domain. 
30 Many of the constitn^ftmctions of immune and inflanmatory responses are inhft 

agents that increase intraceBalar levels of cAMP (V^r^iese, M.W. et aL (1995) MoL PhannacoL 
47:1164-1171). Avarietyof diseases have been attributed to increased PDE activity and assodated - 
witii deoreasedl&ve^ of cyc£c nucleotides. For ^campfe, a form of diabetes insipidus in mice has 
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been associated with increased PDE4 activity, an increase in low-K„ cAMP PDE actrvity has been 
reported in leukocytes of atopic patients, and PDE3 has been associated with cardiac disease. 

Many inhibitors of PDEs have been identified and have undergone clinical evaluation (Peny, 
MJ. and CA. EBggs (1998) Out. Opin. Chem. BioL 2:472-481; Torphy, TJ. (1998) Am. X Respir. 
5 Grit. Care Med, 157:351-370). PDE3 inhibitors are being developed as antithrombotic agents, 
antihypertensive agpnts, and as cardiotoBic agpnts useful in fb& treatment of congestive heart failure. 
Rolipram, aPD£4i]]lubitor,lia5beeniisedintheti:ea!mentof depre^ 

am nttdftrgniiig ftvftlnntinn ns anti -tiiflammato Ty agents. Rolipram has also been shown to inhibit 
lipopolysacdbaiide (LPS) induced TNF-a which has been shown to ^ihaQcc HIV-1 lepfication in vitro . 
10 Ihorefoie, loHpram may inhibit HLV-1 icplicatiDn (Angel, LB. et at (1995) AIDS 9:1137-1144). 

Additionally, loHpram, based on its ability to suppress the production of c^ftoldnes sodi as IT4F-a and b 
and intermixing, has been shown to be ^fective in Che ti«at^ Rohprammay 
also be effective in treatii^ tardive dyskinesia and was effect 

espmmental animal model (Somm^, R et aL (199S) Nat Med. l'244-248; Sasaki, EL et al (1995) 

15 Eur. J. PharmacoL 282:71-76). 

Theophylline is anonspecific PDE inMbitor used in the treatment of bronchial asthma and otiier 
respiratory diseases. Theophylline is believed to act on airway sinootbrnoscle function and in an 
anti-inftammatory or immanomodulatory capaaXy in ftub treatment of respiratory diseases (Banner, 
JLK and CP. Pagp (1995) Eur. Respir. J. 8:996-1000), PentoxifyUme is another nonspechSc PDE 

20 inhibitor nsed in tiie treatment of intermittent chmdication and diabetes-induced peripheral vascular 
disease. P^tttoxi^Iline is also loMiwn to block TNF'-aiM^ 
(Angel et al> supra> . 

PDEs have been rqx>rted to affect cellnlar proliferation of a variety of cell types (Conti et aL 
(1995) Endocrine Rev. 16:370-389) and have be«i implicated in various cancers. Growth of prostate 

25 cardnoma cell lines DU145 and LNCaP was inhibited by delivery of cAMP derivatives and PDE 
inhibitors (Bang, YJ. et aL (1994) Proc. NatL Acad. ScL USA 91:5330-5334). These cdls also 
showed a pemiai^nt conversion in ph^otype from ^ithelial to neuronal morphology. It has also been 
suggested that PDE inhibitors have die potential to regulate mesangial cell proliferation (Matonsovic, 
K. et aL (1995) J. Clin. Invest 96:401-410) and lymphocyte proliferation (Joulain, C et al. (1995) J. 

30 Lipid Medial Cell Signal. 11:63-79). A cancer treatment has been described that involves intraceBnlar 
deliv^ of PDEs to particular cellular compartm^its of tumors, resulting in cell death (Deonarain, 
M P. and AuA. Epeaaetos (1994) Br. J. Cancer 70:786-794). 
Pbosphotriesterases 
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FiiospliotnesteTases (PTE, paraoxonases) arc enzymes that hydrotyze toxic organophosplioros 
compoimds and liave been isolated ftom a variety of tissues. Tbe &myxoe& appear to be lacking in 
birds and insects and abundant in manunals, e^lain the reduced tolerance of birds and insects to 
organophosphorus compound (Wanova, E, and Sogorb, M.A. (1999) Crit Rev. Toxicol 29:21-57). 

5 Hiospliotriesterases play a central role in the detoxification of insecdcides by mammals. 

Fhosphotriesterase activity varies among individuals and is lower in infants Iban adults. Knockout 
mice are markedty more s^isitive to the organopbosphate-based toxins diazoxon and dilorpyrifos oxon 
(Furlong, CR, et al (2000) Neurotoxicology 21:91-100). PTEs have attracted interest as enzymes 
capable of the detoxification of organophosphate-containing ch^nical waste and warfare reagents 

10 (e.g., paratbion), in addition to pesticides and insecticides. Sonoe studies have also implicated 
phosphotdesteiase in atherosclerosis and diseases involving lipoprotdn metabolism. 
Tfaioesterases 

Two soluble dnoesterases involved in fatty add biosynOicsis have beau isolated from 
mammafian tissues, one which is active onl^ toward long-diam fat^acyl tbdoestm and one which is 
15 acth^c toward Moest^ with a vide rangp of &tty-acylchain-I^^ These diioesterases catalyze 

flift cbanwtarminfltjtig step tti the A? nnvn biosynftiesis of fatty acid5s. Chain tmmnation involves tibe 
hydroly^ of flie thioester bond wMch links fbe &tty acyl chain to the 4 -phosphopantedieine prosthetic 
group of ftie acyl camer protdn (ACP) subunit of die £atty add synthase (Smithy S. (1981a) Methods 
Enzymol 71:181-188; Smith, S. (1981b) Methods ^izymol 71:188-200). 
^ S- coK contains two soluble thioesterases, fliioesterase I whidi is active onty toward long-chain 

ac;^ tiuoestm, and Ihioesfcrarase n (TEQ) wUdi has a 1m>ad diain-fen^ 

(1991) J. Biol Gb^ 266:11044-1 10%). g. og£ TEH does not edubit sequence dnoilarity with eitihar 
of die two types of mammailian tiuoesteiases which function as chain-temmiating enzymes in de novo 
fatty add biosyntiiesis. LtoKke the mannnalian to^esterases, B. coli TEH lacks the characteristic 

25 serine active site 0y-X-ser-X-giy sequence motif and is not inactivated by flie serine modifying agent 
^isopropylfhiorophosphate. However, modification of histidine 58 by iodoacetamide and 
diediy^yrocarbonate abohslK^d TEU activity. Ov^xpressionofTEQ did not alter fatty add content 
in E. colL which suggests diat it does not function as a diain-tertxunating ^tzynoe in fatty add 
biosynthesis (Naggert et al, supra) . For that reason^ Naggert et al, (supra) proposed that the 

30 t^ysiplogical substrates for E. T^ may be coenzj^ A (CoA>-fatty add esters instead of ACP- 
|dK>si^panth^iein&-fatty add esters. 
Carfaoxylestcrascs 

Mammalian carboxj^esterases constttute a nmltig^ faonify expressed in a variety of tissues 
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and cell ^cs. Isozymes have significant seqaence homology and are classified primarily on the basis 
of amino add seqaence. Acet^holinesterase, butyrylcholinesterase, and carboxylesterase are 
grouped into the serine super famdly of esterases (B-esterases). Other carboxylesterases included 
thyroglobulin^ thrombin. Factor IX, ghotactin, and plasminogen. Carboxylesterases catalyze the 

5 hydrolysis of ester- and amide- groups fix>m molecules and are involved in detoxification of drugs, 
environmental toxins, and carcinogens. Substrates for carboxylesterases include short- and long-chain 
acyl-glycerols, acylcamitine, carbonates, dipivefimhydrodbloride, cocaine, salicylate, capsaicin, 
pahmtoyl-K:oenzyme A, imidapril, haloperidol, pyrrolizidine alkaloids, steroids, p-nitrophenyl acetate, 
malathion, butanilicaine, and isocarboxaside. The emysmsA oftmx d^nonstrate low substrate 

10 specificity. Carbox^bsteiases are also important for the conversion of prodrugs to Ifa^ 

fi«e adds, whidi maybe flie active form of the drug (e,g., lovastatin, used to bwer blood cholest^l) 
(reviewed in Satoh, T. and Hbsokawa, M. (1998) Amm. Rev. Pharmacol ToxicoL38:257-288). 

Neuroligms are a class of nK>lecuks that (i) have N-terminal sign^ 
ceQ-surfEce receptors, (iii) contain carboxyfest^rase domains, (iv) are higbfy expressed in tbe brain, 

15 and(v)bindtDnearexinsinaca]ciam-dependeDtmamier. Desfrite the homology to carboxylesterases, 
nsuroligm^ lack: &e active site serine lesidne, implyfaig a role in substrate binding raflier than catalysis 
(Iditchenko, EL et aL (1996) J. Biol Oasm. 27 1^2676-2682). 
Sciualene epoxidase 

Squalen epoxidase (squalene monooxygenase, SB) is a nddcrosomal nii«mLbran&-bound, FAD- 
20 dependent oxidoreductase diat catal^;es die first oxygenation step in the sterol biosynthetic padiway 
of Gukaryoticcells. aK>]esten>lis anessesittalstrnctandcoii^nentof 

acquired via file IJ>Lrecepfcor-inediatedpa&way or file bi M die latter case, all 

27 caibonal»nis in ftecholeslcrolniolecale are derived SB 
coirverts sqaakne to 23(S)-oxidosqQalcne, which is th^ converted to lanosterol and l^en cholesteroL 

25 The steps involved in chol&st^l biosymhesis ate sonmiaiized below (Stryer, L (1988) Biocbemistry, 
W.H Freeman and Co., Inc. New York. pp. 554-560 and Safcakibara, J. et al (1995) 270:17-20): 
acetate (from Acetyl-CoA) S-hydoxy-S-rm^^-ghitaryl CoA - mevalonate - 5^|^sphomovalonate 
- S^yrophosphomovalonate - isopesitenyl pyrophosphate - dunethylallyl pyrophosphate - geranyl 
pyrophosphate £amesyl pyrophosphate - squalene squalene epoxide - lanosterol -* cholesterol 

30 While cholesterol is esse^al for the viability of eukaryotic cells, inordinately hi^ serum 

cholesterol levi^ results in ^e formation of atherosclerotic plaqaes in the arteries of higher organisms. 
This deposition of hi^ily insolablb lipid mat^jal onto the walls of ess^idal blood vessels (e.g., coronary 
arteries) results in dcoreased blood flow and potraitial necrosis of the tissues deprived of adequate 
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blood flow. HMG-CoA reductase is responsible for the conversion of 3-liydroxyl-3-methyl-glntaryl 
CoA (HMG-CoA) to mevalonate, which represents the first committed step in cholesterol 
biosynthesis. HMG-CoA is the target of a number of pharmaceutical compounds designed to lower 
plasma cholesterol levels. However, inhibition of MHG-CoA also results in Ibe reduced synthesis of 

5 non-sterol intemicdiatcs (e.g., mevalonate) required for other biochemical pathways. SE catalyzes a 
rate-limiting reaction that occurs later in the sterol synthesis pathway and cholesterol in the only end 
product of die pathway following the step catalys^ed by SR As a result, SE is the ideal targpt for the 
design of anti-hyperlipid^nic drug^ that do not cause a reduction in other necessary intennediates 
(Nakamura,Y.etaL(1996) 271:8053-8056). 

10 E poxide hydrolases 

Epoxide hydrolases catalyze the addition of water to epoxide-contaming compounds, lli^ieby 
hydrolyzmg epoxides to dieir cottesponding 1,2-diols. They are related to bacterial haloalkane 
dehalogeaiases and show sequence similarily to oth^mmibeas of the o^p hydrolase fold femily of 
enzymes (eg., bromopmxidasc A2 fipwn Strgptomvccs aareofadcns . hydrosofmiQconic seoualddhyde 

15 hydiplases from Psendomonas tiutida. and haloalkane dehailogaiase from Xanthobacter 
antottoiihicns) . Epoxide hydrolases ace ubiquitous in iiatore and have bee^foimd in inano^ 
invertebrates, plants^fnngi, and bacteria. This femi]^ of enzymes is important for fee detoxafic^^ . 
xcnobiolic epoxide con^iounds which are oftenUgh^ electrophilic and destrucdve when introduced 
into an organism. Examples of epoxide hydrolase reactions include the hydrolysis of 

20 ds-9,10-epoxyoctade&-9^)-enoic add (leukotoxin) to form its conesponding diol, 
fhreo-9,10-dihydro^/octad6C'12(2^mHc a<dd (leuloDtDxiiL diol), and die hydrolysis of 
d5-12,13-cpo^ctadec-9(Z)-dDK>ic add (isoknlsotoxin) to form its corresponding diol 
thr6o-1243-dihydro3cyoctadec-9Q!d-c(nDica^ IxaibttosDDsaltE^naKanbra^ 

pM-mpjihiKly niy^ jm trflngpnrt and canse ttiflgmTnata ty nefiponses. In addidon, qpoxide carcinogens 

25 are Imown to be produced by <^toctomBP450 as intermediates in the detoxifica^ 

einvironnQental toaans. 

The enzymes possess a catalytic triad corr^sed of Asp (the nuckophile). Asp (fee 

faistidine-suppordng add), and BBs (die watra--activadng tdsddine). The reaction medianism of eporidc 

hydrolase proceeds via a covalently bound ester intermediate initiated by fee nucleophilic attack of one 
30 of the Asp residues on fee primary carbon atom of the epoxide ring of fee target molecuk, leading to a 

coval^tly bound est^ intoiMdiate (Michael Arand, M. et aL (1996) J. BioL Chem. 271:4223-4229; 

Rink, R. ct aL (1997) J. BioL Chm. 272:14650-14657; Argiriadi, MA. et aL (2000) J. BioL Chem. 

275:15265-15270). 
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Catcchol-O-mctfavltransibrase : 

Catechol-O-me&jtoansfem^ (COMT) catalyzes the transfer of the methyl group of S- 
adenosyl-L-inethioiiine (AdoMet; SAM) donor to one of the hydroxyl groups of the catechol substrate 
(e.g., L-dopa, dopamine, or DBA). Methylation of flie 3 -hydro}Q^ group is fevered owr noB&j^tion 

5 of the 4 -hydioxyi group and the memhrane bound isofoim of COMT is naoic regiospecific Ihan the 
soluble form. Translation of the soluble form of the ^o^me results from utilization of an internal start 
codon m a fiiD-leaigdi niRNA (1 J M)) or fiom the ttansladon of a shorts mRNA (1.3 tb), transcribed 
ftom an internal promoter. The proposed Si^2-likeinethyMonitacdon requires Mg** 
byCa**. The binding of flie donor and substrate to COMT occurs sequentially. AdoMet first binds 

10 COMT in a Mg^^-indepondent manner, followed by flie binding of Mg** and (he binding of die catechol 
substrate. 

The amount of COMT in tissues is rcMvely high conqmred to the amount of activity normally 
required, thus inhibition is problematic. Nonetheless, inhibitors havebem developed for in vitro use 
(e.g., gallates, Hopolone, U-0521, and 3',4'-dihydro3^-2-mefli34-piDpiophetropolDne) and for clinical use 

IS (e.g.,nitrocatediol-hased compounds and tolcapone). Administralionofd^ese inhibitors results in die 
increased half-lifoofl^opa and the consequent foinia^ Mibitionof COMT is also 

likely to inciease diehatf-liie of various o&er catecholrstructure compounds, including but not limited 
to epinepluine/DoicpinBplitine, isopienaEne, rimiterol, dobutamme, f^ldopam, apomorpShine, and a- 
meth^ddopa. A deficiency in norepinei^uinehas been Imked to clmical depression, hence the use of 

20 COMT mhibitorscauM be usefim in the treatment of de^ COMT inhibitors are gpaieralty well 
tolerated vwlh minimal side effects and are ultimately metabolized in the liver with ohfy minor 
accumulation of metabolites in die body (MannistD, P.T. and KaalckDla, S. (1999) Pharmacological 
Reviews 51.593-628). 

25 The discoviay of new drug metabolizing enzymes, and the polynucleotides encoding them^ 

satisfies a need in the art by providing new conqw^tions which are usdul in the diagnosis, prevention, 
and treatment of ftiTt"tTnnniTici /mflanmia tDty, cell prolifeative, deyeloimiental, endoodne, eye, 
metabolic, and gastrointestinal disorders, including Hvct disord^, and in the assessment of die effects 
of exogenous conopoimds on the expression of nucleic acid and amino acid sequences of drug 

30 metabolizing enzymes. 

SUMMARY OF THE INVENTION 
The invention feataires purified polypeptides, drug mi^abolizing enzymes, ref^xed to 
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coDectively as "DME" and individually as *T)ME 1," 'VME-2r 'T>ME-4;' *T>ME-5," 

*T)ME"6," "DME-7r 'T>MB-8r "DME-10," 'DME-ll," 'TME^nr •T)ME-13/' "DME- 

14," *T)ME-15," •T)ME-16;' 'T)ME-17;' *T)ME-18," and *T)ME-19.'* In one aspect, the invention 
provides an isolated polypeptide selected firom the group consisting of a) a polypeptide comprising an 

5 amino acid sequence selected ftom the group consisting of SEQ ID NO:l-19 , b) a polypeptide 
comprising a naturally occurring amino add sequence at least 90% identical to an amino add 
sequence selected fiom the group consisting of SEQ ID NO:l-19 , c) a biologically active fragment of 
a polypeptide having an amino add sequence selected fiom the group consisting of SEQ ID NO:l-19 , 
and d) an imnmnog^c fragment of a po^jrpeptide having an amino add sequence selected from the 

10 group consisting of SEQ ID N0:l-19 • In one altemative, the invention provides an isolated 
pnHypRptiflft mTYipriRing tfie aminp add sequence of SEQ ID NO:l-19 . 

The invention fiudi^ provides an isolated polynucleotide encoding a pol$rpeptide selected ftom 
the group consisting of a) a polypeptide oonq>rising an amino add sequence selected from the group 
consisting of SEQ ID NO:l-19 » b) a polypeptide comprising a naturally occuning amino add 

15 sequence at kast 90% identicd to an amino add sequence selected fitimtiie group consi^ 
ID NO:l-19 , c) a biologicidly active fragment of a polypeptide havi^ 
selected from die group consisting of SEQ ID NO: 1-19 , and d) an inamunogcnic fragment of a 
polypeptide having an anodno add sequence selected from die group conidsting of SEQ ID NO:l-19 . 
In one alfccniative, flie polynucleotide encodes a polypeptide select^ 
' 20 IDN0:M9. In another dtetnative^ the polynucleotide is selected from the group consisti^ 
IDNO:20-38. 

Additional^, tiie invention provides a recombinant po]^ucleotide comprising a promoter 
sequence opCTabty linked to a potynudeotide csncoding a pol^ 

consisting of a) a potypqptide comprising an amino add sequence selected from the group consisting 
25 of SEQ ID NO:l-19 , b) a polypeptide conqjiising a naturally occurring amino acid sequence at least 
90% identical to an amino add sequence selected from the group consisting of SEQ ID N0:l-19 , c) a 
biological)^ active fragment of a polypeptide havmg an ammo add sequaice selected from the group 
consisting of SEQ ID NO:l-19 , and d) an immunogenic fragment of a polypeptide having an amino 
acid sequence selected from the group consisting of SEQ ID N0:l-19 . In one alternative, the 
30 invention provides a cdl transformed with Ihe recombinant polynucleotide, hi another alternative, the 
invention provides a transge^c organism comprising the recombinant polynucleotide. 

The invention also provides a method for producing a polypeptide selected from the group 
consisting of a) a polypeptide coniprismg an amino add sequence selected from the group consistiAg 
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of SEQ ID N0:l-19 , b) a polypeptide comprising a naturally occurring amino add sequence at least 
90% id^itical to an amino acid sequence selected fix>m the group consisting of SEQ ID N0:l-19 , c) a 
biologicafly active fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID N0:l-19 , and d) an immunogenic fragment of a polypeptide having an aminD 

5 add sequence selected from the group consisting of SEQ ED NO:l-19 . The method comprises a) 
culturing a cell under conditions suitable fr)r es^iression of the polypeptide, wherein said cell is 
transformed with a recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide encoding tiie polypeptide, and b) recovering the polypqptide so e35)r6ssed 

Additionally, fte invention provides an isolated antibody whidi spedfically binds to a 

10 polypeptide selected from dte group consisting of a) a polypeptide comprising an amino add sequ^ce 
sdected from tiie group consisting of SEQ ID NO:l-19 , b) a polypeptide comprising a naturally 
occoidng amino acid sequence at least 90% identical to an amino add sequence selected from die 
group consisting of SEQ ID N0:l-19 » c) a biologically active fragment of a polypeptide having an 
amino acid sequence selected from the group consisting of SEQ ID N0:I-19 , and d) an immonog^c 

IS fragment of a po!^rp&ptide having an ammo add sequ^ice selected from &e group consisting of SEQ 
IDNO:l-19. ' 

. The invention further jirovides an isolated pofynucleotide selects 
a)apo]^imcl6otid6con[qmsingapolynadeatid6seq[uencesekc^ 

ID NO:20-38, b) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
20 90% id^cal to a pol^dBotide sequence selected from tiie group consisting of SEQ ID NO:20-38» 
c) a polynudeotide cotopl^Enentaiy to Ihe polynucleotide of a)» d) a polynudeotide complcmi^itary to 
1hepol^ck»tideofbXande)anBNAequ]vaAe^ In one altcamative.tiie polynucleotide 

comprises at least 60 contiguous nucleotides. 

Additional^, the invi^on provides a metiiod for detecting a target pol^cleotide in a sanq>H 
25 said tatgjet poi^pnacleotide having a sequence of a po^nucleotide selected from tiie group consistmg of 
a) a polynucleotide comprising a polynucleotide sequoice selected from the group consisting of SEQ 
ID NO:20-38, b) a polynucleotide conqnising a natural^ occurring polynucleotide sequence at bast 
90% identical to a polynuckotide sequence selected from the group consisting of SEQ ID NO:20-38, 
c) a polynucleotide compl^entaiy to the polynucleotide of a), d) a polynucleotide complementary to 
30 the pol^ucleotide of b), and e) an RNA equivalent of a)-d). The method comprises a) hybridizing flie 
sample witii a probe comprising at least 20 contiguous nucleotides comprising a sequence 
con^lementaiy to said target polynucleotide in tiie sampl&» and which probe specifically hybridizes to 
said taigjet polynudeotide, under conditions whereby ahybridization complex is formed between said 
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probe and said target polynucleotide or fragments thereof, and b) detecting the presence or absence of 
said hybridization complex, and optionally, if present, the amount thereof. In one alternative, the probe 
con]()rises at least 60 contiguous nucleotides. 

The invention farther provides a mefliod for detecting a target poljamcleotide m a sample, said 

5 target polynucleotide having a sequence of a polynucleotide selected from the group consisting of a) a 
polynucleotide comprising a po^mncleotide sequeiK» selected from the group consisting of SEQ ID 
NO:20-38, b) a polynucleotide comprising a naturally occurring potynucleotide seqaence at least 90% 
identical to a polynucfeotide seqaence selected from the group consisting of SEQ ID NO;20-38, c) a 
polynucleotide complementary to the polynucleotide of a), d) a polynucleotide complementary to die 

10 polynucleotide of b), and e) an RNA equivalent of a)-d). The method compdses a) ampSfsnmig said 
target polynucleotide or fragment tfaeareof using polymerase chain reaction ampHfication, and b) 
detecting the presence or absence of said amplified tatg^ polynucleotide or fragment thereof, and, 
optionally, if present, tihe amount tiumof . 

The invention farthei: provides a con^iosition conqxnsing an effective amount of a pol^^peptide 

15 selected from the group consisting of a) a polypeptide comprising an amino add sequ^ice selected 
from tiie group consisting of SEQ ID NO:l-19 , b) a polyp^de oonq>rishig a naturally occuning 
amino acid sequence at least 90 % identical to . an amino add sequence selected from tiie group 
. conastingofSEQroNO:M9,c)abiologicdlyactivcfragmentofapolypeptideha 
add sequ^ice sdected fronltiie groiq> consisting of SEQ ID NO:l-19 , and d) an immoonogenic 

20 fragment of a polypeptkSe having an amino acid sequ^ice selected from flie group consisting of SEQ • 
ID N0:l-19 , and a pbaimaceutically acc^tabb exdpirait In one mbodSment, tihe composition 
con^Jiises an amino acid sequence sefccted from the group consistii^ The 
invention additionally provides a method of trcatii^ a disease or condition assodated with decreased 
expscssion of functional DME> oompiistng admimstedng to a patient in need of such treatment the 

25 conq)osition. 

The niv^ition abo provides a method for scteeniug a compound for effectiveness as an 
agonist of a polypeptide selected from flie group consisting of a) a polypeptide comprismg an ammo 
add sequence selected from the group consisting of SEQ ID NO:l-19 , b) a polypeptide comprismg a 
naturally occuning amino acid sequence at least 90% identical to an amino acid sequence selected 
30 from the group consisfcg of SEQ ID NO: H9 , c) a biologplcaliy active fragment of a polypeptide 
having an amino add sequence selected from the group consisting of SEQ ID NO: 1-19 , and d) an 
immunog^iic fragment of a polyp^jtide having an amino ttdd sequence selected from the group 
consisting of SEQ ID NO:l-19 . The nielhodcominises a) exposing a sample coni^risingth^ 
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polypeptide to a com^imd, and b) detecting agpnist activity in the sample. In one alternative, the 
invention provides a composition comprising an agonist compound identified by the method and a 
pharmaceutically acceptable excipient In anoliier alternative, the invention provides a method of 
treating a disease or condition associated with decreased expression of ftmctional DME, comprising 
5 administering to a pati^ in need of such treatment the composition. 

Additionally, &e invention provides a me&od for screening a compotmd for cfFcctivcness as 
an antagonist of a polypeptide selected firom the group consisting of a) a polypeptide comprisiiig an 
amino acid sequence selected bam the group consisting of SEQ ID NO:l-19 , b) a polypeptide 
comprising a naturally occurring anmo add sequence at least 90% identical to an amino add 
10 sequence sdccted from the group consistmg of SEQ ID N0:l-19 , c) abiologicalty active fragment of 
a polypeptide having an amino adki sequence selected from the , 
and d) aninmninogenic fragmieait of a polypeptide having an amino add seqnence selected from the 
group consistmg of SEQ ID NO:l-19 . The method conqnises a) exposing a sample comprising the 
polypeptide to a con^und, and b) detecting antagonist activity m the san^^ In one alt^tnative, the 
15 inv^on provides a composition comprising an ant^nist conqKnmd id^itified by fte method and a 
phannaceuticaltyacce^bleexcipieait In anofliCTalteamative, the invention provides a mefliod of 
treating a disease or condition assodated with overexpression of functional DME, oonqnising 
fldminis tgring to a patient in need of Sttch treatment the oompodtion. 

Tlic invention fiirther provides a mefliod of stealing for a compound that sped 
20 a polyp^de selected from the group consisting of a> a polypeptide con^risiiig an amino acid 
sequence selected from the group consistfaig of SEQ ID N0:l-19 , b) a polypeptide conq«tismg a 
naturally occunii^ amino add sequeiice at least 90% id^itical to an amino acid sequence selected 
from flkB group consislit^ of SEQ ID NO:l-19 , c) a biologically active fragment of a polypeptide 
having an amino add sequeance selected from the group consisting of SEQ ID N0:l-19 , and d) an 
25 immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 

consisting of SEQ ID NO: 1-19 . The method conqwises a) combining the polypeptide with at least one 
test compound under suitable conditions, and b) detecting binding of the polypeptide to tiie test 
compound, tihearby identifying a compound that specifically binds to the polypeptide. 

The invention ftirtha* provides a me&od of screening for a compound that modulates the 
30 activity of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
add sequence selected from tte group consisting of SEQ ID NO:l-19 , b) a polypeptide comprising a 
naturally occurring amiiK> add seqaooce at least 90% identical to an amino add sequence selected 
from die gronp consisting of SEQ ID NO:l-19 , c) a biologjcaliy active fragment of a polypeptide 
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having an aoiino add scquCTuce selected from the group consisting of SEQ ID N0:l-19 , and d) an 
ionnanogenic fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO:l-19 . The method comprises a) combining the polypeptide widi at least one 
test compound under conditions permissive for the activity of the polypeptide, b) assessmg the activity 
5 of Ae polypeptide in die presence of die test compound, and c) comparing the activity of flie 

polypeptide in the presence of the test conqiound with the activity of flie polypeptide in the absence of 
the test conq>ound, wheidn a change in the activity of die polypeptide in the presence of tiie test 
con^und is indicative of a compound fliat modulates the activity of tiie polypeptide. 

The invention fiuth^ provides a meOiod for screening a conxpound for effectiveness in 
10 altcnng e2^[»ession of a target polynackotide, wherem srid targpt polynnclcotide comprises a 
polynucleotide sequ^ice selected from tiie group conasting of SEQ ID NO20-38, the method 
comprising a) ea^joang a sssnp^ compriang the target polynucleotide to a conipound, and b) detecting 
altered expression of the target pol^^cleotide. 

The invention furflieir provides a method for assessing toxidty of a test conqmund, said 
15 metiiod conqirising a) treatmg a biological san^le containing nucldc adds with die test compound; b) 
hybridizing flie nncl^c adds of die treated biolog^ satxq>le wifli a probe comptising at least 20 
contiguous nudeotides of a po^ucleotide selected from the group consisting of i) a polynucleotide 
con^rismg a polynucleotide sequeaice selected from die group consisting of SEQ ID N02(>-38, ii) a 
polynucleotide comprising a naturally occuning polynucleotide seqaeoce at least 90% idaitical to a 
20 polynucleotide seipMcesefecled from die group 

having a sequeaice oon^l^nentary to i), iv) a pol^cleotide complementary to die polynucleotide of 
ii), and v) anRNA eqnivakut of i>iv)L Hybridization occurs undar conditions wtereby a specific 
hybridization con4)lex is formed between said probe and a target polynucleotide in die biological 
san^ile, said targpt polynucleotide selected from the group consisting of i) a polynucleotide comprishig 
25 a polynucleotide sequence selected from the group consisting of SEQ ID NO:20-3 

polynucleotide comprising a naturally occurring pol^cleotide sequence at least 90% identical to a 
polynucleotide sequ^ice selected from die group consisting of SEQ ID NO:20-38, iii) a polynucleotide 
complementary to the polynucleotide of i), iv) a polynucleotide complementary to the polynudeotide of 
ii), and v) an RNA equivalent of iHv). Alternatively, the target polyuucleotide comprises a fragment 
30 of a polynucleotide sequence selected from the group consisting of i)-v) above; c) quantifying the 

amount of hybridization con:q)lex; and d) comparing die amount of hybridization coirq>lex in die treated 
Mological sample wifti die anK>unt of hj^iridization complex in an untreated biological sample, wherdn 
a difrbtence in die amount of hybridization conoplex in the treated biolo^cal sample is imficative of 
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toxici^ of the test compound. 

BRIEF DESCRBPTION OF THE TABLES 
Table 1 summarizes the nomenclatuj» for the fim lengfli pol^^^ 

5 sequences of the present invention. 

Table 2 shows the GaiBank identification nmnber and anttotadon of the nearest GenBank 
homologforpolypeptides of the invention. Tbeprobabffityscoieforth^ 

polyp^de and its GenBank homolog is also shown. 

Table 3 shows stmctural features of polypeptide sequences of die invention, including 
10 predictedmotifeanddomauis,alongwithaieinethods,algoiilh^ 

analysis of fbe polj^peptides. 

Tifl>le 4 Bsls tbe dDNA and/or genomic DNA fiiJgni^ 
polymMOeotide sequences of fee invHDdion, jA^^ 
sequences. 

15 TAfe5diDV«theicpnKBnlatrved)NAlibnuvfor 

Table 6 proves an appendix whidi describes fbn tissaes and wtois nsed for coostaiclion of 

the cDNA fibmies shown in Table 5. 

Table 7 shows Ac tools, piogtams, and algoriflans used to analyze flic polynadeotidcs and 
polypeptides of the invention, along wifli applicable descr^itions, references, and flireshoH pananetBis. 

20 

DESCRIPnON OP THE INVENTION 
B^ieflie present proteins, mdeotide sequences, and metiiods are described, it is understood 
diatfUsinviaitionisnDtliniitBdtofliepaiticnlarinadto 
mayvaiy. it is idso to be understood tiiattijetetminotegy used b«adn is for fliep^ 
25 patticnlarenibodim<mtsonly.andiBnotintBndedtolimitftB^^ 
liefimited onii^bydiB iqppeatidBd claims. 

ft must be noted fliat as used hemn and in the appended clMms, tiie singular fDims "a," "an." 
and "flie" inctade plural reference unless flie context dearly dictates oflierwise. Thus, for example, a 
tefi»enoe to "a host ceff ' includes a plurality of such host ceDs, and a reference to "an antajody" is a 
30 rcferaice.to one ormpre antibocties and equivalajis hereof known to tiiose skilled in the art. and so 
forfli. 

IMess defined otherwise, all technical and sciaitific terms used heardn have the same 
meanmg^ as coramanfyund^stoodby one of oidinaiy skill in Ae art to whidithisinvcjitionbelongs. 
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Atthough any machines, materials, and methods similar or equivalent to those described herein can be 
nsed to practice or test the present mvention, die preferred machines, materials and taethods are now 
desc^ed. AH publications mentioned herein are dted for the purpose of describing and disclosing flie 
cell Hnes, protocols, reagents and vectors whidb are reported in iho publications and which migjit be 
5 used in connection with the mvention. Nothmg herein is to be constraed as an admission that the 
invention is not eaititled to antedate such disclosure by virtue of prior invention. 
DEFINITIONS 

'T)ME" refers to the ainino add sequMioes of substantiafly purified DME obtained fro 
spedes, particularly a mammalian spedes, including bovine, ovine, pordne, murine, equine, and human, 

10 and from any source, whether natural, synthetic, semi-synthetic, or recombinant 

The term "agonisr lefas to a molecule whjk^ intensifies or mimics the biological activity of 
DMR Agonists may include protdns, nucldc adds, carbohydrates, small molecules, or any other 
compound or composition which modulates the activity of DME either by direcdy interacting with 
DME or by acting on components of the biological pathway in whidi DME particqmtes. 

15 An"aIldicvariaiit'*isandternativeformoftiiegeneencodi^ Allelic variants may 

icsult fiom at least one mutation in tiie nucleic add sequraco and may result in altered mRNAs or in 
polypeptides whose structure or fimction may or inay not be a^ A gene may have none, one, or 
many alfclic variants ofilsnatuirfty occurring forni. Common mutational dianges^di give rise to 
allelic variants aie geni^:ally ascribed to natural deletions, additions, or substitutions of nucleotides. 

20 Eadi of tiiese types of changes may occur alone, or in combination wifli tiie oth^, one or more times 
in a given sequence. 

"Alteted" nucleic add sequences encoding DME include fliose sequences wifli deletions, 
insertions, or substitutions of different nucleotides, resuming in a polypeptide the same as DME ox a 
polypeptide witii at least one functional charact^istic of Dl^ Included within this definition are 

25 polymorphisros whidi may or may not be readily detectable using a particular oligonucleotide probe of 
flie polynucleotide enoodmg DME, and improper or unexpected hybridization to allelic variants, with a 
locus ofliar than tiie normal dm)mosomal locus for die polynucleotide sequence Tte 
CTcoded protein may also be "alteaed," and may contam deletions, msertions, or substitutions of amino 
add residues whidi produce a silent diange and result m a functionally equivabnt DME. Deliberate 

30 amino add substitutions may be made on the basis of similarity in polarity, diarge, solubility, 

hydrD0iobidty, hydiophilidty, and/or the amphipathic nature of the residues, as long as flie biological 
or immunological activity of DME is retained. Bor example, negatively charged amino adds may 
include aspartic add and glutamic add, and positively diarged amino adds may include lysine and 
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10 



15 



aremine. Amino adds with imcharged polar side chains haAmgsiinflar^^ 

inclnde: asparagine and ghitamine; and serine and lJueonine. Amino acids wifli uncharged side chains 
having similarhydiophilid^ values may include: leucine, isoleudne. and valine; glycine and alanine; 

and phenylalanine and tyrosine. 

The terms "amino add" and '*anuno acid sequence'' refer to an oligopeptide, pejrtide, 

polypeptide, or protein sequence, or a fragment of any of liiese. and to naturally occuning or synftetic 
molccuks. Where "amfajo add sequence" is redted to reftar to a seqaaice of a naturalbr occuning 
protein molecule, "amino acid sequence" and Hke tenns are not meant to limit die amino add sequence 
to die complete native amino add sequence assodated wifli flw tedted protdn molecuk. 

"Amplification" relales to flie ptoduc^on of additional cojaes of a nncfcac add sequence. 
AmpBfication is generally carried ontnsing polymerase diainieaction (PGR) tedmologies welllmown 
indieatt 

Hk term "antagpnist" refers to a molecule wMdi inMhits or attenuates flie biolDgiad acfcnty 
of DMB. Antagpnists may inclnde pioteiiB sudi as anybodies, nncldc adds, carbolq^drates, small 
mokcales, or any other conqtonnd or cmnposiiion vban^ modulates die activi^ of DME dflier by 
direcdy inhaacting with DMB or by acting on conqronenls of a» biologiod paflway in wMdi DME 



The tMm "antibodjr lefers to mtact irmnuno^obulin inolecifles as wen as to fi^^ 
ibenof, sudi as Fab, F(ab'>„ and Pv fragnaenls, whidi are capable of bmding an epitopic deteiminant 
20 Anfflwdics Aat bind DME poljrpqptidescanbepBparcduan^ 

^^an.«.g ,m«.ll rgrfides of iBtaest as d» imrooniang antigen. Hie polypeptide or oligopeptide used 
to immunize an annual (e.g.,amouse.arat,ota rabbit) can be derived from the transladon of RNA, 
or symbesized diemicany, and canbe conjugated to a carriw piotdn if desired. Commonly used 
caitieisfliat«»dManicdfyccrapledtop^»tidesinctadebownese^ thyroglobulin, and 

25 kejliDlBliDapcthemocyaninCKLH). Ihe coupled peptide is then used to immunize flie animal 

The term "antigenic deteaminanr refers to fliat regjon of a molecule (i.e., an epitope) fliat 
makes contact wifli a particular antibody. When a protein or a fragmait of a protdn is used to 
: immunize ahost animal, numMous regions of die protdn may induce flie production of antibodies 

whidibind specificaDy to antigenic determinants (particular regions or fliree-dimcaisional structures on 
30 the protein). An mtigenic determnant may compete wifli flie intact antigai (i.e.. die immunogenused 
to elidt the immune response) for binding to an antibody. 

The term "antisense" refers to any composition capable of base-pairing wifli flie "sense" 

(coding) strand of a spedficnucldc add sequCTce. Antisense conqwsitions may indudeDNA; RNA; 
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peptide nucleic add (PNA); oligomcleotides having modified backbone linkages such as 
phosphoiothioates^ methylpbosphonates, or b^izy^pliosphoiiates; oligonucleotides having modified 
sugar groups such as 2 -methoxyefliyl sugars or 2 -mefhoxycthoxy sugars; or oligonucleotides having 
modified bases sudi as 5-methyl cyfeosine, 2'-deoxyuracil, or 7-deaza-2 -deoxygaanosme. Antisense 

5 molecules inay be produced by any method including dieinicalsynflieMs or liansc^ Once 
introduced into a cell, the complementary antisenso molecule base-pairs with a natoralfy occuning 
nucleic add sequence produced by the cell to fonn duplexes whidi block dther transcdption or 
trauslatioiL The designation "negative" or •*miiiu5'* can refer to the antiseuse strand, and flie 
designation **positive'* or **plus" can lefer to Ihe sense strand of a reference DNA molecule. 

10 The tenn 'Tiiologically actiro" refers to a protein hamg stractoral, regulatory, or biochemical 

fnnctbns of a naturally occuning molecule. Likewise^ '"immuDologicalty actiYe** or "immunog^c" 
refers to flic capabflity of the natural, rccornhmant, or synflietic DME , or of any oligopeptide flicxeof , 
to induce a specific immune response in s^ipropriate anima]s or cells and to bind wifti specific 
antibodies. 

15 'tjon]^len»^taiy^ describes flietelation&hi^ 

sequences lhat annciQ by base-pauing. Ibr example, S'-AGT-3' pairs with its coinplement, 

. A "composition comprismg a gjven pol^ucleotide sequeiK»** and a "compodtion comprisbg a 
g^ven amino add sequence" refer broadly to any composition contaming fee given polynucleotide or 

20 amino add sequence, Ibe conqK>sit]on may comprise a dry fonnulation or m 

Compositions comprising polynucleotide sequ^ices oicoding DMB or fragments of DME may be 
enq^yed as hybridization probes. Tie probes inay be stored in frerae-dried form and may be 
assodated with a stabSizhig agent such as a carbohydrate. Li hybridizations, tiie probe maybe 
depfoyed in an aqueous solution containing salts (e.g., NaQ), detergents (e.g., sodium dodecyl sulfate; 

25 SDS), and ofeer con^nents (e.g., Dcnhardf s solution, dry milk, sahnon sperm DNA, etc.). 

'Xlonsensns sequence" refers to a nucldc add sequence whidihas been subjected to 
repeated DNA sequence analysis to resolve uncalled bases, extended usmg tbe XL-PCR kit (Applied 
Biosyst^ns, Foster CA) in tiie 5' and/or the 3* direction, and resequenced, or whidi has been 
assembled from one or more overlying cDNA, EST, or genomic DNA fragments using a compter 

30 program for fragment assembly, such as the GELVIEW fragment assembly system (GCG, Madison 
WT) or Phr^ (University of Wadiington, Seattle WA). Some sequences have been both extended 
and ass^nbled to produce Ihe cons^isus sequence. 

'X^onservative amino add substitutions'* are those substitutions tiiat are predicted to least 
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mterfere with flie properties of fte original protean, i.e., the stmcture and especially the function of the 
protein is conserved and not sigmficantly changed by such substitutions. The table below shows amino 
adds which may be substituted Ibr an original amino acid in a protdn and whidi are regarded as 
conservative amino acid substitutions. 



5 


Qrigmal Residue 


Conservative Substitution 




Ala 


Gly.Ser 




Arg 


His, Lys 




Asn 
Asp 


Asp, Gin, His 
Asn, Gia 


10 


Cys 


Ala,Ser 




Gin 


Asn,Ghi,Hi5 




Ghi 


Asp, Gin, His 




Gty 


Ala 




Hs 


Asn, Arg, On, Gb 


15 


Ec 


Leu, Val 




Leu 


Be, Val 




Lys 


Atg,Gln,Oa 




Met 


Leu, He 




Phe 


His, Met, Leu, Trp, Tyr 


20 


Sc^ 


Cys, Thr 




Thr 


Ser,Val 




Trp 


Phe,Tyr 




Tyr 


His, Phe, Trp 




Val 


IlB,Leu,Thr 



25 



Conservative amino add substitutions g^ierally noaintain (a) the structure of the polypeptide 
backbone in fiie area of tiie substitution, for exatuide, as a beta sheet or alpha helical conformation, 
(b) tiie diarge or bydrophobidty of tiie molecub at tiie At& of the substitution, and/or (c) die bulk of 
the side chain. 

30 A "deletion" r^eas to a diange in tiie amino add or nucleotide sequeaDce tiiat results in tiie 

absence of one or more amino add residues or nucleotides. 

The t^m '^dedvatrve" refers to a diemically modified po^nacleotide or polypeptide. 

CbsDoical modificatiDns of a polynucleotide can include, for tawsxph, replaceam^ of bydrog^ by an 

aIkyiac^h)ndiox)4, or amino groiip. A derivative polymicleotide encodes a polypefytidewMdi retains 
35 at least one biological or inomunological function of the natural molecule. A derivative polypeptide is 

one modified by glycosylalion, p^lation, or any dnular process that retains at least one biological or 

iiomtaiobj^cal of the poljrpeptide from \\^di it was derived. 

A "detectable label** reieiE to a reports molecule or enzyme that is capable of generating a 

measurable signal and is covalently or noncovalentty joined to a polynucleotide or po^^eptide. 
40 Difife^ntial expresdoir refers to increased or upregqlated; or decreased, downtegnlated, or 



35 



wo 02/12467 



PCT/USOl/24382 



abs^t gpne or protdn egression, dttonriiied by comparing at least two different samples. Sndi 
comparisons maybe carried out between, for example, a treated and an untreated sample, or a 
diseased and a nonnal sample. 

"Exon shuffling*' refers to the recombination of difieient coding regions (exons). Since an 

5 exontoay represent a stnicturd or functional donoamoftho encoded protein, 

assembled through flie novel reassortnaent of stabfc substructures, thus altowing acceleration of the 
evolution of new protein fnnctions. 

A •'fiagmenT is a unique portion of DME or fee polynucleotide encoding DMB whidi is 
identicd in sequeiiyce to but shorter in lengft than th^ A ftagment may comprise up 

10 to Ihe entile lengfli of flie defined sequence, minus one nuckofide/anDdno add residue. For caanq>lc, a 
fragment may comprise from 5 to 1000 contiguous nucleotides or ammo add r^idues. A fragment 
used as a probe, primes, antigen, Aefapeutic molecule, or for other purposes, maybe at least 5, 10, 15, 
16, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous nucleotides or amino add 
residues in length. Eragments may iK^pieferCTtiaDy selected from certain regions For 

15 exan^le, a polypeptide fragment may comprise a certain leaigdi of contiguous amino adds selected 
from the fibrst 250 or 500 ammo adds (or first 25% or 50%) of apolypqitide as shown m a certain 
defined sequeaice. Qeartyflieselengflis are exemplary, arid any lengfli that is supported by the 
spedfication, hicluding te Sequence listing, trfjles, and figures, may be en^ 
erribodiments. 

20 Afragment of SEQroNO:20-38 compisesarBgion of unique pol^ 

spedficalty identifies SEQ ID Na20-38, for example, as distinct from any oflier sequ^ce m the 
g^ome from wHdi Ihe fragn^nt was obtained. Afragm^tof SEQIDNO:2a-38isusefdl,for 
exan^le, in hybridization and amplication tcdmologies and in analogous methods that distinguish SEQ 
ID NO:20-38 from related polynucleotide sequences. The precise length of a fragm^ of SEQ ID 

25 NO:20-38 and the region of SEQ ID NO20-38 to whidi the fragment corresponds are routinely 
dete^tminable by one of ordmary skill in the art b ased on die intended purpose for the fragment 

A fragment of SEQ ID NO:M9 isencodedby afragmentof SEQIDNO:20-38. A 
fragment of SEQ ID NO: 1-19 comprises a re^on of unique amino add sequence fliat specificafly 
identifies SEQ ID NO:l-19 . For example, a fragment of SEQ ID NO:l-19 is useful as an 

30 immunogenic peptide for the development of antibodies that specifically recognize SEQ ID N0:l'-19 . 
The precise lengfli of a fi^gment of SEQ 1DN0:1-19 and tiie region of SEQ ID N0:l-19 to which 
flie fragment conesponds are routine^ determinable by one of ordinary sldll in the art based on fiie 
intended purpose for flie fragment 
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A 'Iftifl kmgth^ polynucleotide seqae»ce is one contaimng at least a translation initiation codon 
(e.g.»]]K:!ddoiiine) followed by an op^ieadiiiigfir^ A*%Q 
length'' pol^ucleotide sequence encodes a '^II lengtii" polypeptide sequence. 

^"Homolog/' relets to sequence shnilaii^ or, interchangeably^ sequence identity, betweeai two 
5 or inoic polynucleotide sequMices or two ox inore polypeptide sequences. 

The terms **peiccnt identity^' and identity," as applied to polynucleotide sequ^ces, refo: to 
the percentage of residue matches between at least two polynucleotide sequences aligned using a 
standardized algoiidmL Such an algoiidun may insert, in a standardized md reproducible way, gaps in 
the sequences being compared in order to optinnze alignment between two sequences, and therefore 
10 achieve a more meaningful comparison of tihe two sequences. 

Percent identity betwe^ po^ucleotide sequences may be determined using tfie definilt 
parameters of the CXXJSTAL V algorithm as incorporated into the MBGALJGN version 3. 12e 
sequence alignmc^ program. This program is part of the LASERCIBNE software package, a suite of 
mokcular biological analysis }HX)granis (DNASTAK, Madison WI). CLUSTAL V is described in 
IS BBggins, D.G. and P.M. Sharp (1989) CABIOS 5:151-153 and m H^s, D.G. et aL (1992) CABIOS 
8:189-191. For pairwise afigom^its of poljmcleotide sequences. Hie default parameters are set as 
folbws:i&opl6=2,^p@D;a]ty^5, wiDdow=4 The 'Vdghted"^ residue 

weight table is selected as file de&nk. Percent identity is i^rted by CLUSTAL V as the ^rcent 
similanty between aligned polynucleotide sequences, 
20 Alternatively, a suite of conimonfy used and fieelyavailabksec^ 

is provided by &e National Center for Biotechnotogy Infonnalian (NCBI) Bade Local Alignment 
Search Tool 03LAST) (Altsdml, S.R et aL (1990) L MoL BioL 215:403-410X wMdi is available fiom 
sevi^ sources^ including the NCBI« Bethesda, MD, and on the Inteniet at 

bttp'/A»ww ticbi Tilm fiih gnv/RT The BLAST software suite includes yarions sequence analysis 

25 programs includmg *1>lastn/' that is used to align a known polynucleotide sequence with other 
polynucleotide sequences fiom a variety of databases. Also available is a tool called "BLAST 2 
Sequences" diat is used for direct pairwise con^Yarison of two nudeolide sequences. *'BLAST 2 
Sequences" can be accessed and used interactively at htlp-y/www.ncbi>n1m-nih.gov/gorM>12JitniL The 
''BLAST 2 Sequences** tool can be used for both blastn and blastp (discussed below). BLAST 
30 programs are commonly used with gap and other parameters set to default settit^. For example, to 
compare two nucleotide sequences, one may use blastn with die **BLAST 2 Sequences'* tool Version 
2.0.12 (Apnl-21'2000) set at default paramet^. Sudi dei^iult parameters may be, for example: 
Matrix: BWSUM62 
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Reward for makh: 1 
Penalty for mismatch: -2 
Open Gap: 5 and Extension Gap: 2 penalties 
Gap X drop-off: 50 
5 Expect: 10 

Word Size: 12 
Filter: on 

PerceDt identic maybe measured over the length of an entire defined sequence, for example, 
as defined by a particular SEQ ID number, or may be measured over a shorter length, for exan^k, 
10 over the lengfli of a fragment taki^ from a larger, defined sequence, for instance, a fragment of at 
least 20» at least 30, at least 40, at least 50, at least 70, at least lOQ, or at least 200 contigoous 
nucleotides. Sacfal^igllisaieexi»i^laryon^,anditisimdei5toodft^ 

by the sequ^ioes shown b^rdbii, in the taSjles, figures, or Sequence listing, may be used to desoibe a 
lengtii o vi^ whicb perc^itag^ identic may be measured. 

15 NqcMc add seqaeeuces that do not show a high degree of identity may 

similar amino add sequences due to tibodegma^^ Itismideistood&atdiaiiges 
in a nucldc add sequence can be made using Ifais degenmkcy to produce multiple nucleic add 
sequences tiiat an encode substantially ^ same protdn. 

Tbe phrases ''peicent id^titity^' and identity,'' as applied to pol^ptide sequences, refer to 

20 the p^eentagp of residne matdics between at least two polypeptide sequoices aligned using a 

standardized atgontimt Mediodsof po^/p^tidesequ«icealignaijrad;areweB-known. SomealigDm^t 
metiiods take into account conse£ryative ammo add substitutions. Sudi conservative substitutions, 
e}q)laiaed in more detail above, g^oeraUy preserve tiie charge andjiydrophobidty at die site of 
substitution, thus preserving the strocture (and therefore function) of the polypeptide. 

25 Percent identity between polypeptide sequences may be determined nsing the default 

parameters of die CLUSTAL V algorithm as incorporated into tiie MEGAIIGN version 3,12e 
sequence aUgnm^ pmgram (described and referenced above). For pairwise alignments of 
polypeptide sequences using CLUSTAL V, the default parameters are set as follows: Ktnple=l, g^p 
penalty=3, window=5» and "diagonals saved"=5. The PAM250 matrix is selected as the default 

30 residue wei^t table. As with polynucleotide afignments, the percent identity is reported by 
CLUSTAL V as flie "percent similarity^ between aligned polypeptide sequ^ce pairs. 

Altemativdiy file NCBI BLAST software suite xnay be nsed. For example, for a pairwise 
con^mrison of two polype^de sequ^ces, one may use die '^LAST 2 Sequences" tool Vdsion 
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2.0.12 (AiMril-21-2000) wthbtas^ set at defaolt paramebas. Sudi dcfeuk parametMs maybe, for 



Matrix: BWSUM62 

Open Gup: 11 and EjOertsion Gap: 1 penalties 
S Gflp X drop-off: SO 

Expect: 10 
Word Size: 3 
FUter: on 

Percent ideaitity may be measuied otot tihe fcngfli of an entire defined polypeptide sequence, 
10 for example, as defined by a pardcnlar SEQ ID number, or may be measured over a sbottsx length, 
for example, over fbe iMgfli of a fiagment takoi from a larg^, defined polypeptide sequence , for 
instance, a fiagment of atleast 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 
ISOcontigiioTistesidaeB. Sacfa]eajgtoai»ex^plaiyonty,andhisundeistoodthatanyfi^ 
fcngfli supported by fee seqiieaices dH>wnhe^ 
15 used to describe a lengfli over vMch p^c^itage identity may be measured. 

•Wintiaa artificial dteomDsomes** QiACs) are bnear microchromosomes wbidi may contain 
DNA sequences of about 61btol0Mbinsi2eand wiich contain all of the elements required for 
chmmosome teplLcalion, segregation and maintenance. 

The term "hnmanized antibody^ refers to an antibody molecule in which the ammo acid 
20 sequence in ibe non-antigen binding regions has been altered so tiiat the antibody more closely 
lesOTbles ahuman antibody, and still retains its original binding ability. 

'Hjbiidization'' refers to the process by vMch a polynu ckotide strand anneals witii a 
complementary strarri tiiroughbase paiiiog under defined hybridization conditions. Specific 
hybridization is an indication tiiat two nucleic acid sequences share a high degree of con^lementarity. 
25 Specific hybridization complexes ftmn under permissive annealing coiwfitions and remam hybridized 
after die •*wafihing"stc^s). Ihe washing step(s) is paiticularly important in detenmnh^ 
string^cy of die hybridization process, with more stringcait conditioi^ 

bindinga.e., binding between painjofnncleicacidstrand^ Permissive 
conditions for annealing of nucHc add sequences are routinely detMmmableby one of ordinary skill in 
30 the art and maybe consistent among hybridization experiments, whereas wash comfitions maybe 
varied among experiments to acaibvefcedeaw^ 
Permissive anneJdfaig conditions occur, for exan^l^^ 
1 % (w/v) SDS y and about 100 figted Cleared, denatured salmon sperm DNA. 



39 



wo 02/12467 



PCT/USOl/24382 



G^ierally, stringency of hybridization is expressed, in part, with reference to the teanperature 
under which the wash step is carried out Such wash temperatures are typically selected to he ahout 
5"C to 2(fC lower than ttie thennal melting point (T^ for the specific sequence at a defined ionic 
strength and pK The T^^ is the temperature (under defined ionic str^gth and pH) at which 50% of 
5 the target sequence hybridizes to a perfectly matched probe. An equation for calculating T„ and 

conditions for nucleic add hybridization arc well Imown and can be found in Sambrook, J. ct al. (1989) 
Molecular goning: A Laboratory MannaL 2^ ed., vol 1-3, Cold Spring Harbor Press, Plamview NY; 
specifically see volume 2, chapto- 9. 

High stringency conditions for hybridization between polynucleotides of flie present inviraition 
10 include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0.1% SDS, for 1 hour. 
Alternativefy,tempeffatniesofabout65^60**a55*<:^or42'€ SSCconcCTtoationmay 
be varied fiom about 0.1 to 2 x SSC, wifli SDS being present at about 0. 1%. Typcally, blocking 
reag^ are used to block non-specific hybridization. Such blocking reagents include^ for instance, 
dbeared and d^iatai^d salmon spemDNA at about 100-200 |tg^ Organic sohrwit, sudi as 
15 fonnamidc at a concentration of about 35-50% v/v, may also be used undear particular duEcumstances. 
such as for RNA--DNA hybridizations. Usrfnl vaiiatkms on tiiese wash conditions will be readily 
parent to those of ordinary skin in tiie art Hybridizarion^particularlfy underbid stringency 
conditions, rnaybesng^stiveofevolnti0naiysinQilaritybetweMi fee Suchsimilarilyis 
strongly indicattve of a similar role for the nucleotides and their encoded polypeptides. 
20 The term "hybridization cample]^ refes to a complex formed between two imcleic add 

sequences by virtue of the formation of hydrogen bonds between complaneiitary bases. A 
hybrid£z;ation coinplex may be formed in solution (e.g. , C^Jt or R,jt analysis) or formed betwe^ one 
nucleic acid sequence preseart in solution and anotii^ nucldc add sequence inunobilized on a solid 
support (e.g.. pap«-, membranes, filters, chips, pins or glass slides, or any oflicr appropriate substrate 
25 to which cells or their imckic adds have been fixed). 

Tlie words *lnsartion** and "addition'* refer to diangps in an ami^ 
sequence resulting in the addition of one or rnore amino aci 

"hmnune response" can refer to conditions assodated with mflarnniation, trauina, immune 
disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression 
30 of various factors, e.g., cytokines, chemokines, and other signaling molecules, wMdi may affect 
cellnlar and systemic defense systems. 

An "imrnunog^c fragment" is a po^peptide or oligopeptide fi^gment of DME whidi is 
csq>able of eliciting an immune response when introduced into a living organism, for exanqple, a 
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mammal The tcnn "immimogenic fragment" also inclodcs any potypeplide or oligopeptide fragment of 
DME wMch is useful in any of tbe antibody production methods disclosed herein or known in the art 

The term •*microairay" refers to an arrangmient of a plurality of polynucleotides, 
polypqrtides, or other chemical compoui^ on a substrate. 
5 The terms "elemcnf and "array elem^' refer to a polynucleotide, polypeptide, or oflier 

diMnical compound having a unique and defined position on a microarray. 

Ths term '^modulate'" refers to a diangp in the aclmty of DMR I^^ 
may cause an increase or a decrease in pioldn activity, binding characteiistics, or any oflier biological, 
ftmctioiial, or inmiunolo^cid properties of DMR 
10 The phrases '^nucleic acid*' and •*iHicleic add sequoice'* refer to a nucleotide^ oligDnudeotide, 

polynndbotide, or any fiagment thereof. These phrases also lefea: to DNA or iWA of genomic or 
synthetic origin which may be smgle-stranded or donbk-stianded and may repress flie sense or the 
autisense strand, to peptide nucleic add ffNA), or to anyDNA-fike or RNA-like material 

"Opcrabljr Mnked** icfeas to flie dtuation in ^ch a first nadeic add seqacaKse is placed in a 
15 fdncdonal relalionshqj with a second nttdeic add sequence. Rwr instance, a piomoter is operably 
linked to a coding seqa^ice if the promoter a£fect5 the transciqplion or e^qnession of flie coding 
sequence. Opeiri>tyliiikedDNA sequences niaiy be in dose proximity or cor^ 
necessary to join two piotdn coding regions, in die same reading frame. 

'Teptide nucldc adtf* (PNA) ref^ to an antisraise molecule or anti-gisne agpnt which 
20 coii]{Rrises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of 
aiiiino add reddnes ending in lysine. The terininal lysnie confers solabilft^ PNAs 
pieferentiaDt/ bind complementary smgle stranded DNA or RNA and stop transcript elongation, and 
may be pegylated to extend ihdr lifespan in die cell 

Tost-tnmslational modification'' of an DME may involve Hpidation, glycosyiation, 
25 phosphorjdation, acetylation, racemization, proteolytic cleavage, and other modifications known in the 
art These processes may occur syntiheticany or biodiOTiically. Biochemical modifications will vary 
by cell type depending on the enzymatic milieu of DME. 

*Trobe" refers to nucleic acid sequences encodmg DME, tiieir complem^ts, or fragments 
thereof, whidi are used to detect identical, aDelic or related nucleic add sequences. Probes are 
30 isolated oligonucleotides or polynucleotides attadied to a detectable label or reporter molecule. 

Typkal labek inchide radioactive isotopes, ligands, diemfluminescent agents, and e^izymes. 'Trimers** 
are short nucldc adds, usually DNA oligonucleotides, whidi may be annealed to a target 
polynucleotide by coiK4>kaDQent^ Theprimermay then be extended ak)ng the target 
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DNA strand hy a DNA polymerase enzyme. Primer pairs can be used for ampliiicatioii (and 
identification) of a Bucleic add sequence, e.g., by the polymerase chain xeacdoa (PGR). 

Probes and primers as used in tbe present invention typically comprise at least 15 contiguoos 
nucleotides of a known sequence. In order to enhance specificity, longer probes and primors may also 
5 be emptoyed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 
or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers 
may be considerably longer than these exan[q>les, and it is understood that any length supported by ^e 
spedfication, incbding the td>les, figures, and Sequence listing, may be used. 

Methods for preparing and using probes and piimeis are described in the references, for 

10 example Sambrook, J. et aL (1989) Molecular Clouing: A Laboratory MannaL 2"* ed., vol 1-3, Cold 
Spring Harbor Press, Plainview NY; Ausubel, RM. et aL (1987) Curr^ Pmtocols in Molecular 
Biology. Greene PabL Assoc, & wa&y-iQtersdences, New York NY; Innis, M. et aL (1990) PCR 
Protocols. A Gnide to Methods and Applications , Academic Bress. San Diego CA, FCR primer pairs 
can be derived finm a known sequence^ for exaniple, by using computer programs intended for that 

15 purpose sudi as Frimer (Vmion 0^, 1991» WUtdiead Institute for Biomedical Reseatdi, Cambridge 
MA). 

OligonndeotLdcsforuseasprimars are selected using software known in the art for sudi 
purpose. For exampH OUGO 4.06 software is useful for the selection of FCR primer pairs of up to 
100 nucleotides each, and for the analysis of ofigormcleotides and laiger polynucleotides of up to 5,000 

20 nncleotjdesfromaninputpolymicleotidesequenceofupto32ki]^^ Sinular primer selection 
programs have incorporated additional foatores for expanded ciqpabifities. For ^sample, the PrimOU 
primer sefection program (available to llie pubfic from die (Seaome Cesiter at iMverdty of Texas 
Soutii West Medical Center^ Dallas TX) is capable of dioosing specific primers fipom megabase 
sequotKresaod is diususefol for designing prinoeis on a gp^ The Frinai^3 primer 

25 selection program (available to ^epubHc from the WMteheadlnstitate^MTT Ce^^ 

Reseatdb, Cunibridge MA) allows the user to input a "^mispriming hT)rary," in which seqaeiKxs to 
avoid as primer bindiog sites are user-specified. Primer3 is usefol, in particular, for the selection of 
oligonucleotides for microarrays. (The source code for the latter two primer selection programs may 
also be obtained firom thdr respective sources and modified to meet the user's specific needs.) The 

30 PrinieGien program (available to fliepubhc from t^ 

Centre, Cambridge UK) designs primers based on multQ>le sequence aHgnments, theareby aUowing 
selection of primers that hybridize to either the most conserved or least consCTved regions of aligtted 
nucl^ add sequences. Hence, tins program is usefol for identification of both UMfpie and cons^ved 
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oligoinickotides and polymicleotide fragments. The oHgoraicleotides and polynucleotide fragments 
identified by any of the above selection methods aie useful m hybridization technologies, for example, 
as PGR or sequencing primers, microarray elements, or specific probes to identify fully or partially 
complementary polynucleotides in a sample of nucleic adds. Methods of oligonucleotide selection are 
5 not Umited to those described above. 

A "recombinant nucleic add" is a sequence that is not naturally occurring or has a sequence 
that is made by an artificial combination of two or more otiierwise separated segments of sequence. 
This artificial combination is often accomplished by diemical synlhesis or, more commonly, by the 
artificial manipulation of isolated segments of nucleic adds, e.g., by genetic engine^nng tedmiques 
10 sudi as those described in Samibiook,5n^ The teimrecombiDant includes nucleic adds that have 
been aheied soMy by addition, substitution, or deletion of a portion of die nucldc add. Frequent!/, a 
recombinant nndeic add may include a nucleic add sequence operably linked to a promote sequence. 
Such a recombinant nucleic add may be part of a vector that h used, for example, to transform a cell 
AtomativB^!^, snc^ p^m^^jit^if"* micldc adds m^be part of a viral vector, e.g., based on a 
15 vacdma virus, tiiat could be use to vacdnato a mammal wherein tiic rec ombi nant nucldc add is 
eiqiressed, indudng a protective immunological response in the m am m al. 

A **regplatDry elements rders to a nuddc add sequ^ice usu ally derived from untranslated 
regions of a gene and includes enhanoets, promote, introns, and S* and 3* untranslated regions 
(UTE^). Regulatory el^ni^its intraract widi host or viral proteins whidi control transcription, 
20 translation, or RNA stability. 

*%po]ter motecnles** are diemical or biodiemical moieties used for labeling a nuddc add, 
amino add, or antibody. Reporter molecules include radionuclides; enzymes; fluorescent, 
f^j yattihunme scgntj or chromog&nic agraits; substrates: cofactors: inhibitois; magnetic particles; and 
other moieties known in the art 
25 An "RNA equivalmt,** in reieraace to a DNA sequence, is composed of tiie same linear 

sequence of iracfeotides as die relCTence DKA sequence with the exception tiiat afl 
the nitrogenous base thymine ate r^laced with uracil, and the sug^ir backbone is composed of ribose 
c instead of deoxyribose. 

The term "sanqile" is used in its broadest scaise. A sample suspected of containing DME, 
30 nucleic acids encoding DME, or fragments thereof may comprise a bodily fluid; an extract from a cell, 
chromosome, organelle, or membrane isolated fi^m a cell; a cell; genomic DNA, RNA, or cDNA, m 
solution or bound to a substrate; a tissue; a tissue print; etc. 

Theterms ''spedficbindu^ and "spedficalbf binduig'' refer to thatintoractionbetweeaaa 
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protein or peptide and an agonist, an antibody, an antagonist, a small nioIeci]le» or any natural or 
synthetic binding composition. The interaction is dependent upon the pres^ice of a particular stractorc 
of the protein, e.g., the antigenic determinant or epitope, recognized by die binding molecule. For 
example, if an antibody is specific for epitope "A," the presence of a polypeptide comprising the 

5 ^tope A, or tiie pres^ce of free unlabeled A, in a reaction containing free labeled A and the 
antibody win reduce the amonnt of labeled A diat binds to die antibody. 

Ilie t^m ''sabstantiall^ purified'' refi^ to mideic ad 
removed from fiieir natural environm^ and ate isolated or separated, and are at least 60% free, 
preferably at least 75 % fiee, and most preferably at least 90% free from ofh^ components wifli 

10 whidi they are naturaDy associated. 

A ''subslitation'' refers to the replacment of one or more amino add residues or nucleotides 
by diffearent aniino add residues or nucleotides, respective]^. 

''Substrate'' refers to any suitable rigid or s^m-rigid sup^ 
dnps, slides, wafers, fibers, magn^c or nonmagnetic beads, gels, tubing, plates, polymers, 

IS- mictoparticles and capillaiies. lliesubstratecanhavcavarietyof surface forms, sudi as wells, 
trendies, pins, diannds and pores, to wbidi polynucleotides or polypeptides are bound. 

A "transfsiqit image** refers to die collective pattern of geaie expression by a particular cell 
type or tissue nnder given conditions at a gjmn time. 

nrransformation'' describes a process by wbich exogenous DNA is introduced into a redpi^ 

20 ceE Transformationmayoccurundernaturalor artificial conditions according to van 

wen known in the art, and may rely.on any known method for the insertion of foreign nucleic add 
sequences into a prokaryotic or eubuyotic host cell The method for transformation is selected based 
on the type of hoist cell being transformed and may itK^lude, but is not limited to, bacteriophage or viral 
infection, electxoporation, heat shock, lipofcction, and partide bombardment The term '^ansformed 

25 cells** includes stably transformed cells in wbidi die inserted DNA is capable of replication dther as 
an autonomously replicating plasmid or as part of the host diromosome, as well as transiently 
transformed ceDs which eiqxress the inserted DNA or RNA for limited periods of time. 

A "transgenic organism,** as used herein, is any organism, including but not limited to animals 
and plants, in wbidi one or more of die celb of the organism contains heterologous nucleic acid 

30 introduced by way of human intervention, such as by transgenic techniques well known in the art The 
nucldc add is introduced into the ceQ, direct^ or indirectly by introduction into a precursor of the cell, 
by way of delibe^te genetic inanipnlarion, sudi as by ndooii^ection or by infecdcm witii a 
recornbinaatvirus. Ibetmngieaietic manipulation does ]K>tiix^^ 
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fcrtiSzation, but rather is directed to flbie iittroduction of a recombinant DNA molecule. The transgenic 
organisms contemplated in accordance vnUi the present invention include bacteria, cyanobactena, 
fungi, plants and animals. The isolated DNA of th^e present invention can be introduced into fliehos 
by methods known in the art, for example infection, transaction, transfoimation or transconjugation. 

5 Tedmiques for transferring the DNA of the present inmition BOito such organisms are widely known 
and provided in references sudi as Sambrook et aL (1989)» supnu 

A 'Varianf * of a particular micHc add sequence is defined as a nucMc add seqa^ice having 
at least 40% sequence id^id^ to the particular nucleic add sequence over a certain letiglli of one of 
flie nnckic add sequences using blastn with the TBLAST 2 Sequences'' tool Verdon 2.0.9 (May-07- 

10 1999) s^ at defonlt parameters. Sudi a pair ofnncldc acids niay show, for example, at feast 50%, at 
least 60%, at least 70%, at least 80%. ^ least 85%, at least 90%. at least 91%, at feast 92%, at feast 
93%, at least 94%, at least 95%, at feast 96%, at least 97%, at least 98%, or at feast 99% or greater 
sequence identic over a certain defined fength. A variant maybe described as, for exan^fe, an 
"altelic" (as defined above), "splice," "spedes," or *>>lymorphic'' variant A splice variant may have 
. 15 significant identity to a refaCTcemolecufe^lnit win fi^iMdtylmTO 

polymicfeotidf^ ^ nTtiin^fttft jgpHfttng of eofons daring mRNA procesdng. The corresponding 
po^^pdde may possess additional fianctional domains or lack domains tfiat are present in the 
re&sence molecule. Spedes variants are polynucleotide sequences that vary firom one spedes to 
anofter. Iheresuldi^g polypeptides wiflgpnetanylmvedgnificant 

20 odier. A polymotphic variant is a variation in the po^ucfeotidesequtaice of a particular gene 

between iiMitvidnals of a given spedes. Polymorphic variants also may encompass '^smgle nncleotide 
pol^imoipiusms'' (SNPs) in wMdi flie po^nucfeotide sequ^ice varies by one micleotide base. The 
pres^ice of SNP& may be indicative of, for example, a certain populatbn, a disease state, or a 
propensity for a disease state. 

25 A **varianf' of a particular polypeptide sequence is defined as a pol^rpeptide sequeaice having 

at least 40% sequence identity to the particular polypeptide sequence over a certain lengdi of one of 
the polypeptide sequences using blasip with the 'TBLAST 2 Sequences" tool Version 2.0.9 (MayW- 
1999) set at default parameters. Such a pair of polypeptides may show, for exan^fe, at least 50%. at 
feast 60%, at least 70%, at least 80%, at feast 90%, at feast 91%, at least 92%, at least 93%, at least 

30 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% or greater sequence 
identity over a certain defined fength of one of the polypeptides. 

THEINVENTION 
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The invention is based on tbe discovery of new human drug metabolizing «izymes (DME), the 
polynucleotides encoding DME, and the use of these compositions for the diagnosis, tieatmraat, or 
previEsntion of autoimmmie/inflammatory, cell proliferative, developm^tal, endocrine, eye, metabolic, 
and gastrointestinal disorders, including liv^ disorders. 

5 Table 1 summarizes the nomenclature for die full length polynucleotide and polypeptide 

sequences of the invention. Eacb polynucleotide and its corresponding polypeptide are correlated to a 
single Incyte project identification nnniber (Incyte Project ID). Each polypeptide sequence is d^ioted 
by boib a polypeptide sequence id^nti&ation numb^ (Polypeptide SEQ JD NO:) and an Incyte 
polypeptide sequence numb^ (Incyte Polypeptide ID) as ahown. Badi polynucleotide sequcaice is 

10 denoted by bofli a polynucleotide scqa^ice identification munber (Polynucleotide S£Q ID NO:) and an 
JsMcyt^ polynucleotide consensus sequ^ice numbv (bicyte Pdl^nncleotide ID) as shown. 

Table 2 shows sequences wilhbonQology to ^ polypeptides of Hie invention as identified by 
BLAST analysis against Hie GenBonk protean (geaqiept) database. Columns 1 and 2 show the 
polypeptide sequ^co id^itification number (Po^peptide SBQ ID NO:) and tiae corresponding Incyte 

IS polypeptide seq;ue]K:e nnniber ([ncytePo^peptiden!>)fo^ ColunmS 
shows Ibe GenBank idootification nnmb^ (Cienbank ID NO:) of the nearest C^nB ank bntnolog. 
Column 4 shows the piobabiUty score for tiie matdi between each polyp^tide and its GenBank 
bomolog. Column 5 shows tiie annotation of tfae GenBank bomolog along witii relevant citations 
wb^ s^licable^ all of wbicb are e^qxcessly incorpomtsd by reference herein. 

.20 Table 3 ^ws various structural features of ibepolyp^tides of the invention. Columns 1 and 

2 show fhe polypeptide sequence identification number (SBQ ID NO:) and the corresponding Incyte 
polypeptide sequence number (Incyte Polype^de ID) for each polypeptide of tiie inv^itioiL Column 

3 slK>wsti)enumb^ of amino add residues in eacb polypeptide. Column 4 ^ows potential 
phospborylation sites, and column 5 shows potential glycx)sylation sites, as determined by die MOTIFS 

25 program of tiie GCG sequence analysis software package (Genetics Computer Group, Madison WI). 
Column 6 shows amino acid residues comprising signature sequences, domains, and motiis. Column 7 
shows analytical methods for protein structurcAfunction analysis and in some cases, searchable 
databases to which the analytical mediods were applied. 

Together, Tables 2 and 3 summarize flie properties of potypeptides of the invention, and these 

30 prop^es establish that the claimed polype^des are drug metabolizing enzy^ For example, SEQ 
ID N0:3 is 40% identical to a mouse cytochrome F450 monooxygenase (GenBank ID g2653663) as 
detcrinined by flie Basic Local Mgnment Search Tool (BLAST, see Table 2). The BLAST 
pfobabihty score is 5.3e-91, which indicates tiie probability of obtaining the observed polypeptide 
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sequence aligiiment by diance. SEQ ID N0:3 also contains cytoGhrome P450 signature sequences as 
determined by searching for statistically significant' matches in hidden Markov model (HMM> 
based PFAM database of conserved protein family domains (see Table 3). Data jfrom BLIMPS and 
PROFILESCAN analyses pro^de fardier corroborative evidence that SEQ ID N0:3 is a member of 

5 die cytochrome P450 family. 

In an alternative example, SEQ ID NO:l is 58% identical to a lysyl oxidase from the yellow 
pCTdi (Perca flavescens ; GenBank ID §4929199) as determined by BLAST analysis Tbe BLAST 
probabiHty score is 1.9e-248. SEQ ID N0:1 also contains cytochrome P450 agnature sequences as 
determined by searching for statisticafly significant matches m the HMM-based PFAM database of 

10 conserved pmtein family domains and by BIIMFS analyses. 

In an altemative example, SEQ ID is 61% idcmtical to hnman flavin-contaming 
monoojcygpnase 5 (GenBank ID g559()46) as determined by BLAST analysis, wifli a probability score 
of4Je-18L SEQ ID NOl2 also contains flavin-containing monoo3^gpn^ 
determined by seantog for statisticaHy significant niatdhes mihe 

1 s conserved protein family domams and by BLIMPS and PROHLBSGAN analyses. 

M an altemative example, SBQID NO:4 is 39% identical to a Pteeadomonas 23-botanediol 
ddbydrpgooase (GenBank ID gS29564) as detenmned by BLAST analysis, with a probability score of 
2.0e-6L SEQ ID NO:4 also cantams dehydrog^[iase signature sequences as determined by searching 
for statisticalty significant matches in the HMM-based PFAM database of conserved protein family 

20 domains. Data fiomBLIMPS, MOTIFS, and PROFILBSCAN analyses provide further coiroboralive 
evidence that SEQ ID N0:4 is a ddrydiDgenase. 

SEQID N05 is 54% identical to a BadDas aoinone oxidase (GenBank ID §^2633069) as 
detenmned by BLAST analysis, with a probabifity score of 7.1e-96. Data obtained by searching the 
HMM-based PFAM database of conserved protein famQy domains and by BLIMPS analyses provide 

25 further oorroborativD evidence tibat SEQ ID NO:5 is a quinone oxidase. 

Jn an altocnativB exanqpfe, SEQ ID NO:6 is 92% identical to mouse heparan sulfate 6- 
snlfotxansfbrase 2 (G^ank ID g6683558) as determined by BLAST analysis, wifli a probgibility 
score of 2.3e-255. 

In an altemative example, SEQ ID NO:7 is 90% identical to a human ghitathione S- 
30 transferase swbnnit (GenBank ID g242749) as determined by BLAST analysis, with a probability 
score of 1.3e-101. SEQ ID N0:7 also contains gbtalhione S-transferase signature sequences as 
d^^mined by seardiing for statistically significant matches in the HMM-based PFAM database of 
conserved protein family domains and by BLIMPS analyses. 
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In an alternative example, SEQ ID NO:8 is 40% identical to a human steriod dehydrogenase 
(GenBanklD g5531815) as determined hy BLAST analysis, with a probability score of 1.9e-56. SEQ 
ID N0:8 also contains dehydrogenase signature sequences as determined by searching for statistically 
significant matches in the HMM-based PEAM database of conserved protem family domains. 

5 SEQ ID NO:9 is 47% identical to a rabbit liver caiboxylesterase (GenBank ID g3219695) as 

determined by the Basic Local Alignment Search Tool (BLAST). (See Table 2.) The BLAST 
probability score is 63e-72, which mdicates the probability of obtaming the observed po^^tide 
sequaace aUgnmenl by chance. SEQ ID N0:9 also contains catbosgtestraase donudns as determined 
by searching for statistically significant matches in the Mddeix Maitov model CHMM3-based EFAM 

10 database of conserved protein family domains. (See Table 3.) Data framBUMPS, MOTIFS, and 
PROEILESCAN analyses provide furthi^ corroborative evidence that SEQ ID NO:9 is a 
carboT^lesfceiase. 

SEQ ID NO:10 is 45% identical to hnman carboxylBsterase (GeaiBaiik ID gl8Q950) as 
dctismiincdbythoBaacLocalABgnD^ TheBLAST 
15 piobahgity score is 8.7e-130, which indicates tihe probabilily of obtammg flie observed polypeptide 
seqaeoceaBgnment by chance. SEQ ID NO:10 also contains carbox3^esterasBdoniams as 
detemmed by searching for stalisticalty significant mab^ 

based PFAM database of conserved protekfmuty domains Data firom BLIMPS, 

MQIIFS, and PROHLESCAN analyses provide fiirthea: corroborative evidcaice that SEQ ID NO:10 
20 is a carbox^BSt^tise. 

In an altmiative exanaple, SEQ ID Nail is 89% identical to mnrinehqparan sulfete 6- 
saHotransfease2 (GeaiBaiiro g6683558) as detmninedbyflie 

(BLAST, see Tabic 2). The BLAST probabiEty score is 1.8c-236, which indicates ib& probability of 
obtaining ti» observed polypeptide sequence afignment by chance, and provides evidence that SEQ ID 

25 N0:11 is a DME, and spedficaffly fliat SEQ ID NO:ll is a sulfotransferase. 

In an altematrve example, SEQ ID NO:I2 is 25% id«[itical to a Bacillus sabtilis epoxide 
hydtolase (GenBank ID g2633182) as deteraiined by the Basic Local Alignment Search Tool 
(BLAST, see Table 2). The BLAST probability score is LSe-ll, whidi mdicates the probability of 
obtaining tte observed polypeptide sequence aKgnment by diance. SEQ ID N0:12 also contams 

30 hydrolase domains as det^mined by seardung for statistically significant matches in the hidden 
Mmkov model (HMM>based PFAM database of conserved protem famify domains (see Table 3). 
Data from BLIMPS masses provide furflier corroborative evidence fliat SEQ ID N0:12 is a 
hydtolase. 
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la an alfecafiiativo example, SEQ ID NO:13 is 83% identical to a rat beta-alaninc-pyruvate 
aminotransferase (GenBank ID gl944136) as determined by the BLAST analysis (see Table 2). The 
BLAST probability score is l.le-234. SEQ ID N0:13 also contains aminotransferase domaios as 
determined by seardbing for statisticalb^ significant matches in liie hidd^ Markov model ^MM)- 
5 based PFAM database of conserved protein family doniaim (see Table 3). Data fix>m BLIMPS and 
PROHLBSCAN analyses provide furtiber corroborative evidence fliat SEQ ID N0:13 is an 
aminotransferasew 

In an altsnaative example, SEQ ID NO:14 is 50% identical to a guinea pig hyroxysteroid 
saUbtransf^rase (GenBank ID gll51081) as deteatninBd by tiie BLAST analyds (see Table 2). Tbe 
10 BLAST ptobabiHty score is S.4e-34» and provides ei^id^ice fbat SEQ ID NO:14 is a snlfotransferase. 

In an atteamadve example, SEQ ID NO:15 is 52% id^caL to a goineapig copperMic 
sapmxide dismntase (GeSkBank ID gl066120) as detmnined by tbe BLAST analyds (see Table 2). 
The BLAST piobabifity score is 2.16-25. SEQ ID N0:15 also contains copp^c/^inc sopoozide 
dismutase domains as detcmtined by seardbing for statistically significant matches in die bidden 
IS. Markov ]nod6l(EIMM)-basedFFAM database of consefved protein &n^^ 

Data £rom BUMPS analyses provide fiudiBr corroborative evidence fliat SEQ ID NO:15 is a 
coppest/TXDo superoxide dismutase. 

SEQ ID NO:16 is 37% identical to buman 3 -pihosplK>adenyl^lsu]fat&-galactosylceTamide 
d'-'Sidfotransferase (cerebioside solfotransferase, GenBank ID glS7I141) as determined by tbe Basic 
20 Local Alignment Search Tool (BLAST, see Table 2). Ibe BLAST probabifi^ score is 2.8e^0> \7bicb 
indicates tbe probability of obtaining die observed po^peptide sequence alignment by cbance. 

In an altetiattve exan:^>b, SEQ ID N0.17 is 38% ide^itical to a putative C elcgans 
monoamine oxidase (Ge^ank ID g6782275) as determined by BLAST analysis witb a probability 
score of 3.0&-99. SEQ ID NO: 17 also contains a monoamine oxidase domain as determined by 
25 searching for statistically significant matdbes in tiie bidden Markov model (HMM>based PFAM 
database of conserved prot^ family domains (see Table 3). Data from BLIMPS analysis provide 
fmttker corroborative evidence that SEQ ID N0;17 is a monoanune oxidase. 

In an alteanative example, SEQ ID NOrlS is 36% identical to buman catechol-O- 
melbyltransferase ((JenBank ID gl79955) as determined by BLAST analysis with a probability score 
30 of 9.5e-41. SEQ ID N0:18 is also 36% id^tical to murine catecbol-O-mctbyltransferase (GenBank 
ID g3493253) as detennbed by BLAST analysis wilb a probability score of 1.3e-41. 

In an ahBsmiivG example, SEQ ID NO:19 is 44% identical to Rmdnlas heterpclitos 
cytodin>ineP4502Nl (G^iBatikn) §5852342) as dete3!mined by die Baric Ixx:ali^^ 
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Tool (PLAST, see Table 2). The BLAST pTobabffity score is 4.6e-99, which indicates the probability 
of obtaining the observed polypeptide sequence aligmnent by chance. SEQ ID N0:19 also contains 
cytochrome P450 domains as determined by searching for statistically significant matches in the 
hidden Markov model (HMM>based PFAM database of conserved protein femily domams (see Table 
5 3). Data firom BUMPS, MOTIFS, and I^OFII£SCANanal^ 
evidence that SEQ ID NO:19 is a cytochrome P450. 

The algprithms and parameters for ^e ana^is of SEQ ID N0:l-19 are described in Table 

7. 

As shown in Table 4» the fiill ka:^ polynucfeotide seqaences of the present invention were 
10 assembled using cDNA sequoices or coding (exon) sequences deiived ftom graiomic DNA, or any 
combination of these two t^qpes of sequences. Cohmmis 1 and 2 list die poljpraicleotide sequence 
ideolifbation number OPolytmcleotide SEQ ID NO:) and the cortespondmg Incyte polynnc1eotid& 
consensus sequence numbor (hicyte Polynucleotide ID) for cadi polynacleotide of Ihc invention. 
CohinmB showsthelengdiof eachpolynncleotideseqara^ Column 4 lists fragments of 

IS the pofynndeotide sequ^ices which are useful, for example, in hybridization or anq>lification 
technologies that identify SEQ ID N(>20-38 or that distinguish betwecai SEQ ID NO:20-38 and 
related po^nuckotide seqaooces. Column 5 shows identification nonibers conespondhig to cDNA 
sequences, codii^ sequences (exons) predicted from gawnodc DNA, and/or sequence assemblages 
coo^itisedofbotfacDNA and genomic DNA. Tbeseseqaences wrae used to assemble the foil laigfli 
20 polynucleotide sequences of the invention. Columns 6 and 7 of Table 4 show ttie nucleotide start (5') 
and. stop (3') positions of die cDNA and/or gjenotnic sequences in cohmm 5 relative to fiieir respective 
foU length sequences. 

The identification nnixibexs in Cblunm 5 of Table 4 may lef^ specifically, for example, to 
Jncyt& cDNAs along with flieir cotrespondhig cDNA libraries. For example, 7690384X1 is the 

25 idwitification nmnber of an Incyte cDNA sequence, and PROSTME06 is the cDNA library from 
wbich it is derived. IiK^yte cDNAs for whidbi cDNA libraries are not indicated were derived fro 
pooled cDNA hbraries (e.g., 55017748J1). Alternatively, die idwitification numbers m column 5 may 
refer to GenBahk cDNAs or ESTs (e.g., gl203094) which contributed to the assembly of the foil 
lengfli polynucleotide sequaices. In addition, the identification numbers in column 5 may identify 

30 sequences derived fix)m die ENSEMBL (The Sanger Centre, Cllambridge, UK) database (ie., those 
sequences including die designation "ENST**). Alternative^, the identification numbers m column 5 
may be derived fit>m the NCBI RefSeq Nucleotide Sequence Records Database (ie. , those 
sequences indndnig the designation "NM" or 'WT) or the NCBI RefSeq Protein Sequence Records 
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{Le, » fiiose sequences includnig die designation ''NFO- Alternatively, die identiBcation munbocs in 
cofamm 5 may tefer to assemblages of both cDNA and Genscan-predicted exons brou^t togeOier by 
an "exon stitching" aJgoriJian. For example, ¥L_XXXXXX Jf^J^^YYYYYJf^Na lepieseails a 
"stitched" sequence in whidi XXXXXX is tiie id^itification number of the cluster of seqaences to 

5 wMch flie algoritbm was applied, and YYYYY is the number of tbe prediction generated by the 

algorithm, and Ni^_, if present, represent specific exons that may have been manually edited during 
analysis (See ^lampb V). AltMnativety, the identification numbers in column 5 may refer to 
assemblies of exons brought togelh^ by an "exon-stretdhdngf* algorithm. For example, 
VUQCKKXX^^zA^iAAA^BBBBJ. J^is^ identification number of a "stretched'' sequ^ce, with 

10 XXXXXXhesBg the Incyte project ideaitification number, gAAMA bemg the GenBank identification 
masAy&t of die human genomic sequence to which the "exon-sttetdiing"' algorithm was applied, 
gPBBBB being the G^ank idenlificatbn mxtnb^ or NCBI RefiSeq id^ifification mmiber of die 
neatest G^iDaiik protein homology and Nreftxnng to Ininstanccs 
where a ReiSeq sequenoe was used as a protein homolog for the "exon-stretduog" algpridmiy a 

IS RefSeq identifier (d^oted by •TM," ^OT,** or ''NTO may be used m place of die GenBank identifi^ 
(Le,,gPBBBB}, 

Ahematively, a prdBx identifies componi^ sequences that were hand-edited, predicted fix>m 
gjenoinicDNAsequ^ices^ordeiivedfiGomacoiabina^ The 
foHowing Table hsts ezan^les of campoD&ot sequence prefixes and corresponding sequence anal3sis 
20 methods associated wifti the prefixes (see Example IV and Example V). 



Prefix 


Type of ansdysis and/or examples of programs 


GNN,GFG, 
ENST 


Exon prediction from genomic sequences using^ for example^ 
GBNSCAN (Stanford University, CA, USA) or F6ENBS 

(Cominiter Genomics Group, The Sanger Centre, Cambridge, UK) 


6BI 


Hand-edited analysis of genonoic sequ^ices. 


FL 


Stitched or sttetched gjcnonnc seqaences (see Exanople V). 


INCY 


RiB lengdi transcript and exon prediction fiioin mapping of EST 
seqaences to the genome. Genomic location and BST composition 
data are combined to predict the exons and resulting transctipt 



in some cases, hicyte cDNA coverage redundant widi the sequence co vi^ragf^ shown in 
column 5 was obtained to confirm the final consensus polynncleotide sequence, but the relevant Incyte 
30 cDNA identification nuinibm ace not shown. 
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Table 5 ^ws the iqxtesentathre cDNA libraries for those Ml length polynucleotide 
sequences which were assembled using Incyte cDNA sequences. The representative cDNA library 
is the Incyte cDNA library which is most ftequently represented by the Incyte cDNA sequences 
whidh were used to assemble and confirm the above po!tynucbotide sequences. The tissues and 

5 vectors which were used to construct the cDNA libraries shown in Table 5 are described in Table 6. 

The invention also encompasses DME variants. A preferred DME variant is one which has 
at least about 80%, or alternatively at least about 90%, or even at least about 95% amino acid 
sequence identity to the DMB amino add scqu^ce, and which contains at least one functional or 
structural characteristic of DMB. 

10 Ihe invention also encompasses polynucleotides whidh encode DMB. Inapartknlar 

CTibodiment, tiie invention eiux>mpasses a polynucleotide seqo^ce con^rising a sequence selected 
£romthegix>upcon^tmgof SEQIDN(>20'38,whidhc3icodcsDMB. The polynacleotide sequences 
of SEQ ID NC):20-3 8 » as piesented in the Sequence listing, embrace the equivalent RNA sequences, 
whetein occattmices of the nitrogetious base thymine are replaced with uracD, and sugar 

15 badd>one is conoposed of tibose instead of deoxyribose. 

The inve3DtdK>n also encompasses a van Ih 
particalar; such a variant po^ynucleottde sequence will have at least about 70%, or alternatively at least 
alHmt 85%, or even at least alK>ut 95% polynucleotide sequence id^ti * 
sequence encoding DME. Aparticularaspectof theinv^onenconoypassesavatiantof a 

20 polyoncleotide sequence conoi|»i^Dg a sequence selected from die group consisting of SEQ ID NO:20- 
3 8 whidbi has at least about 70% , or altmiatxvely at least about 85 %, or even at feast about 95 % 
polyDUcleotide sequence id^dity to a noclejc add sequence selected from l3bR group consistuig of SEQ 
ID NOt20-38. Any one of ^e po^nmcleotide variants described above can encode an amino acid 
sequence whidi contains at least one hmctional or stmctural characteristic of DME. 

25 It will be ^predated by those slaDed in die art that as a result of the degeneracy of the 

gj^ietic code, a multitude of polynucleotide sequences encoding DME, some bearing minimal similarity 
to die polymicleotidesequmies of any known and naturally occurring gen^ Thus, 
the invention contemplates each and every possible variation of polynucleotide sequence that could be 
made by sefecting combinations based on possible codon choices. These combinations are made in 

30 accordance widi the standard triplet generic code as applied to die polynucleotide sequence of 

naturally occurring DME, and all such variations are to be considered as being specifically disclosed. 

Aldiou^ nucleotide sequences which ^icode DME and its variants are generally capable of 
hybridizing to die nucleotide seqTi^u:e of die naturally occurring DMB under appropriately selected 
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conditions of stringency, it may be advantageous to prodacc nucleotide sequences encoding DME or 
its d^ivatives possessing a substantiafly different codon usage, e.g., inclusion of non-naturaHy 
occniring codons, Codons may be selected to increase the rate at whidb e:q>ression of the peptide 
occurs in a particular prokaryotic or eukaryotic host in accordance with fireqnency wifli wluidi 

5 particular codons are utilized by the host Other reasons for substantially alteririgflie nucleotide 
sequence encoding DME and its derivatives without altering the encoded amino acid sequences 
include tiie productian of SNA transcripts having more desirable properties, such as a greats hatf-life, 
tim tcansoipts produced ftom &e naturaUy occurring seq^ 

The invention also ^icompasses production of DNA sequences which encode DME and 

10 DME derivatives, or fragments thraeof, entirely by synthetic c^^ After production, the 

synthetic sequence may be insetted into any of ^ many available expression vectors and ceQ systems 
using reagents wen Imown in file art Moteov^, synthetic cSiemistrymaybeused to introduce 
nnutations into a sequence encoding DME or any fragment tiu^reof. 

Also mcon^passed by ihe invention are po^ucfeotiide sequences that are capable of 

1 S hybridizing to the claimed polynudeotide sequences, and, in particular, to those shown in SEQ ID 
NO:20-38 and firagmn^ Ihraeof und^ various conations of stringency. (See, e.g., Wahl, G.M. and 
SX. Beffgpr (1987) Mefliods Bnzymol 152:399-407; Kimmd, A JEL (1987) Meftiods En2ynioi 152:507- 
51 1.) Hybridization conditiois, including annealing and wash conditions, are described in "Definitions." 
Methods for DNA sequencing are well known in the art and may be used to practice any of 

20 tiieembodim^its of the invention. Ihe methods may ei]qiloysucii»i27ymes as the 

of DNA polymerase I, SEQUENASE (US Biodinnical, Qoveland OH), Taq polymerase (Applied 
Biosystems), tbetmostable 17 polymcme (Ammham Fhamiada Biotech, Fiscataway NJ), or 
combinations of polymerases and pioofiieading exonucleases such as tiiose found in tibe HLONGASE 
ampHfication system (Life Technologies, Gaithersburg MD). Preferably, sequence preparation is 

25 automated with niachinessudi as the MICROIJ^ 2200 Eqm 

FrC200 tiiermal cycl^ (MJ Researdi, Wateatown MA) and ABI CATALYST 800 thermal cyclear 
(Applied BiosystMns)» Sequencing is then carried out using eitii^ the ABI 373 or 377 DNA 
sequencing system (Applied BiosystBms)» tte MEGABACB 1000 DNA sequendng system 
(Molecular Dynamics, Sunnyvale CA), or other systenis known in the art The resulting sequences 

30 are analyzed using a variety of algorithms which are well known in the art (See, e.g., Ausubel, F.M. 
(1997) Short Protocols in Molecu lar Bmlopy. John ^ey & Sons, New York NY, unit 7.7; Meyeis, 
ILA. (1995) Molecular Biology and Biotechnology. WHey VCH, New York NY, pp. 856-853 ) 
Ilie nucleic add sequqoces ^icoding DME may be extended util^^ 
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seqoencc and employing various PCR-based methods taiown in the art to detect upstream sequences, 
such as promoters and regulatory elanents. For example, one method which may be employed, 
restrictLon>site PGR, uses imiversal and nested primers to ampUfy unknown sequence from genomic 
DNA within a cloning vector. (See,e.g.,Sarkar, G. (1993) PCRMethodsApplic. 2:318-322.) 

5 Another method, inverse PGR, uses primers that extend in divergeot directions to amplify unknown 
sequence JBrom a circularized tonplate. The template is derived from restriction fragnientscoinprisi^ 
a known genomic locos and sotrounding sequences. (See, e.g,, Triglia, T. et al (1988) Nucldc Adds 
Res. 16:8186.) A third niE^od, capture PGR, involves PGR anq>lification of DNA fragments adjaceait 
to known seqaences infanman and yeast artificial chromosome DNA. (See» e.g., Lag^trom, M. et 

10 al. (1991) PGR Methods AK>lic 1:111-119.) M this method, innltqjle restriction enz;ynDedigq^ 
ligations maybe used to insert an engjuaeered double^stranded sequ^ice into a region of unknown 
sequence before performing PGR Oflier mediods whidi may be used to retrieve unknown sequences 
ate knownin file art. (See, eg., Pa&er; J.D. et ai. (1991) Nudcdc Adds Res. 19:3055-3060). 
Additionally, one may use PGR, nested primers, and FROMOTERHNDER libraries (Gbntech, Palo 

15 Alto GA) to walk g^mic DNA. This piXK:ediue avoids the need to scisen libraries an^ 
finding intron/exon junctions. For anPGR-^asedmeltiodSypcinietsinay be designed using 
(xnxmsxdaSiy availabfe software, such as OLKjO 4.06 prim^ analysis software Q^ational 
Bioscioices, Plymouth MN) or anofter appropriate program, to be about 22 to 30 nucleotides in l^oigth, 
to have a GC content of about 50% or riiore, and to anneal to the trai^late at ten^ratures o^ 

20 68^tD72^C 

When scte^ung for full l^ogOi cDNAs, it is preferable to use libraries that have been 
^ze-seleded to include laigardDNAs. In addition, random-primed libraries, wbich often include 
sequonces containing the 5* regions of graies, are preferable for situations in which an oligp d(T) 
library does not yield a fuB-lengtli cDNA Genomic libraries may be useful for extension of sequence 
25 into 5* non-transcribed regulatory regions. 

GapiHary ekctroi^oresis systenis wM<^ are conmierciallyavMlableniay be used to analyze 
die size or confirm the nucleotide sequence of sequencing or PGR products. Jn particular, capiflary 
sequ^cmg may ^ploy flowable poj^noers for electrophoretic separation, four diffa:^ nucfeotide- 
specific, lasCT-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
30 emitted wavelengflis. OutputAlight int^isity may be converted to electrical signal using appropriate 
software (e.g., C3EN0TYPER and SEQUENCE NAVIGATOR, Applied Biosystems), and the entire 
process from loading of samples to computer anatysis and electronic data display may be conDputer 
controlled. C!qxillary electrophoresis is espedaS^ preferable for sequencing small DNA fi-agments 
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which may be pms^ iu limited amounts in a particular sample. 

In anottier embodiment of the invention, polynucleotide sequences or fragments thereof which 
encode DME may be cloned in recombinant DNA molecules that direct expression of DME, or 
fragments or functional equivalents thereof, in appropriate host ceDs. Due to the inherent degenca^cy 

5 of the genetic code, odrer DNA sequences whidi encode substantially the same or a functionally 
equivalent amino acid sequence may be produced and used to express DME. 

The nucleotide sequences of die present invention can be engmeered using methods generally 
known in the art in order to alter DME-encoding sequeaices for a variety of purposes including* but not 
limited to, modification of tiie cloning, processing, and/or egression of tiie gene product DNA 

10 glnifFKTig by random fragmentation and PCR reassemibly of g^e fragments and syndetic 

oHgonucleotidesmaybensedtoengmeerfhemcleotidesequCT^ For example, ofigonacteotid^- 
mediated site-directed mutagenesis may be used to introdnoe mutations tiiat create new resbiction 
sites, alt^ glycosylation patterns, diange codon prdeicnce, produce splice variants, and so fbiftu 

The nucleotides of die present jnv^on may be subjected to DNA shufOing tedudqnes such 

15 as MOLECULARBREEDING (Maxygpn Inc., Santa Clara CA; described in US. Patent Nunaber 
5,837,458; Chang, C.-C aL (1999) Nat BiofcedmoL 17:793-797; Christians, RC et aL (1999) Nat 
Biotedmol 17:259-264; and Qamm, A. ct aL (1996) Nat Biotechnol 14:315-319) to alter or improve 
tibe biological piop6rd6& of DME, such as its biological or enzymatic acti^ or its ability to bind to 
other molecules or compounds. DNA diufflmg is a process by wHch a library of gpnevari^ 

20 produced using PCR-inedlalediw>nibmation of gene fragments The libraiy is flien subjected to 
selection or scireening procedures fliatideiitifyftu>se gene varia These 
prefened variants may flicii be pooled and iunfaa: subjected to recursive rounds of DNA shuffling and 
selcction/screeaimg. Thus, geiKlic&veisity is created through 

evolution. For example, fragments of a smgle gene containing random pomt mutations may be 
25 recombined,scareatted, and fliMiiesliafHeduntfl the desired prop AHematively, 
fragments of a given geije may be lecombined with fragments of homologous genes in the same gene 
family, dtlbfir fiom flie same or different species, tiiereby m a ^mi yan g flie g^ietic diversity of multiple 
naturally occurring genes in a directed and controllable manner. 

In another embodiment, sequences encoding DME may be synthesized, in whole or in part, 
30 using chemical methods well knovm in tiie art (See, e.g., Camthers, MIL et al. (1980) Nucleic Adds 
Syn^. Ser. 7:215-223; and Bfom. T. et aL (1980) Nucleic Adds Symp. Ser. 7525-232.) Alternatively, 
DME itself or a fragment th««»fniay be synthesized usmgdiem^ For example, peptide 

synthesis can be performed using various solution-i^iase or solid-phase techniques. (See, e,g.. 
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CrdghtDn, T. (1984) Protdms, Stroctnies and Molecolar Properties, WH Freeman, New York NY, pp. 
55-60; and Roberge, J.Y. et aL (1995) Sdence 269:202-204.) Automated synfliesis maybe achieved 
nsiBg the ABI 431A peptide synthesizer (Applied Biosyst^). Additionally, the amino acid sequence 
of DMB, or any part fliereof, may be altered during direct synthesis and/or combined with sequences 

5 fiom other proteins, or any part thereof, to produce a variantpolyp^^ 
sequence of a naturally occurring polypeptide. 

The peptide may be substantially pwrified by preparative high perfomoance liquid 
duomatography. (See, e.g., Chiez, RM. and F.Z. Regnicr (1990) Metiiods EnzymoL 182:392-421.) 
Hie composition of the synthetic peptides maybe confirmed by amino add analysis or by sequencmg. 

10 (See, e.g.,Crei^ton,su]^ pp. 28-53.) 

hi ordra- to express a biologicalty active DME, the nucleotide sequ^sces eacodrng DME or 
derivatives thereof may be ins^tcd into an appropriate expression vector, Le., a vector whidi contuns 
ftie necessary elements for transcriptional and translational control of the insrated coding sequence in 
a suitable host These elemi^ iiK^de regulatory sequences, sudi as enhancers, constitutive and 

15 indttdble promote, and 5* and 3'unlranslated regions in the vector and m polynucleotide sequwices 
encodingDMB. Sudi ekatncsnls may vary in thdr strength arf Specific initiation dgnals 

inay dso be used to achieve more efBdeait translation of sequences encoding DM Sudisignals 
include the ATGimtiationcodon and adjacent sequences, e.g.tiiBK^ Meases where 

. sequMices ^coding DME and its mitiation codon and upstream regulatory sequ^ces are inserted mto 

20 flicappTopriatee:iqttesdonvector,noadditionalttaiisciiptional 

needed. Bbwevca-, in cases wheic only coding scqucaice, or a fia^^ 

exogenous translational control signak including an ii^firame ATG initiation codon should be provided 
bylhevector. Exog^ustransMonaldemients and initiation codons may be of va^ 
natural and syntiietic The effici^icgr of expression inay be ©dhanced by the incto^ 
25 appropriate forthepaiticnlarhostcellsystCTiuscd. (See, eg., Sdiarf, D. et al (1994) Results ProbL 

CellDiffbr. 20:125-162.) 

M^iods which are wellknown to tiiose skilled in the art may be used to constract expression 
vectors containmg sequences encodmgDME and aippropriate transcriptronal and translational control 
elements. These inelhods include mjitoreconibm^ 
30 mSi gpnetic rccornhmation. (See, e.g., Sambrook, J. ct aL (1989) Molecular Qomn^, A Laboratory 
MaimaL Cold Siring Harbor Press, Plainview NY, ch. 4, 8, and 16-17; Ausubel, RM. et aL (1995) 
Cnrtemt PtPtocols in Molecular Biology . John Wiley & Sons, New Yoik NY, ch. 9, 13, and 16.) 
A variety of expression vector/host systans inay be utilized to contain 
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encoding DME. These include, but aie not limited to, noicioorganisiDS sacb as bacteria transformed 
with lecombiaant bactenophage, plasmid, or cosmid DNA expression vectors; yeast transformed with 
yeast expression vectors; insect cell systems infected with viral expression vectors (e.g., baculovinis); 
plant ceD systems transfbimed witii viral e^qncession vectors (e^g., cauliflower mosaic virus, CaMV, or 

5 tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or 
animal cell sj^tems, (See, e.g., Sambrook, supra ; Ausubel, supra; Van Heeke, G. and S.M, Schuster 
(1989) J. BioL Chem. 264:5503-5509; Engelhard, E.K. et aL (1994) Proc. Nad Acad. Sci. USA 
91:3224-3227; Sandig, V. et aL (1996) Hum. Gene Ther. 7:1937-1945; Takamatsu, N. (1987) EMBO 
J. 6:307-311; The McGraw ESn Yearbook of Sdaice and Technology (1992) McGraw BOH, New 

10 York NY, pp. 191-196; Logan, J. and T. Shenk (1984) Proc. Naa Acad. ScL USA 81:3655-3659; and 
Harrington, J.J. et al (1997) Nat Genet 15:345-355.) Expression vectors derived fix>m retroviruses, 
ad^u>virases, or herpes or vaccinia vimses, or from various bacterial plasmids, may be used for 
delivery of nucleotide sequ«ices to the targeted organ, tissue, or cell population. (See, e.g., Di Nicola, 
M. et aL (1998) Cancer Gen. Ther. 5(6):350-356; Yu, M. et al (1993) Proc. Natl Acad. SdL USA 

15 90(13):6340-6344; BuJler, R-M. et aL (1985) Nature 3 17(6040):8 13-815; McGregor, DJP. et al, (1994) 
MoL IramimoL 31(3):219-226; and Veima, LM. and N. Somia (1997) Nature 389539-242.) The 
iiLvention is not fimited by the host cell employed. 

In bacterial systems, a nmnb^ of cloning and esqpression vectors maybe selected dependiDg 
upon the use int^idedjforpolyniideotide sequences encoding DMR For example, routine cloning, 

20 snbcloning^ and propagation of po^wcleotlde sequences encodixig DMB can be adiieved using a 
multifunctional RooK vector such as PBLUESCRIPT (Stratagcne, La JoIla CA) or PSPORTl 
plasmid (Life Technologies). ligationof sequences encoding DMB into the vector's minltq^^ 
site disrupts the JocZ gpine, allowing a colotimetric sareening procedure for identification of 
tygms forme d bacteria containmg recoinbinant molecules. In addition, diese vectors may be usefaL for 

25 in vitro transciiption, dideoxy sequencing, singb strand rescae wifti helper pliagea and creadon of 
nested dektions in die cloned sequence. (See, e.g.. Van Ebeloe, G. and SM. Sdmster (1989) J. Biol 
Chem. 264:5503-5509.) WhenlaigeqaandtiesofDMBareneeded,e.g. forthepioductionof 
antibodies, vectors wMc^directlugblevclesqxression of DMB inay be us^ For exaniple, vectors 
containing die strong, inducible SP6 or T7 bacteriophage promoter may be used. 

30 Yeast expression systems may bc used for production of DME. A number of vectors 

containing oonstittttive or inducible promoters, sach as a^tha fiactor, alcohol oxidase, and PGH 
promotBTs^ maybe used in the veast Saccbaromvces c^visiae or Kchia pastoris. In addition, such 
vectors direct either the secretion or intracelhilar retention of expressed proteins and liable integration 
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of foreign seqaenccs into the host genome for stable propagation. (See, e.g., Ausubel, 1995, supra ; 
Bitter, G. A. et aL (1987) Methods EnzymoL 153:516-544; and Scoier, C.A. et aL (1994) 
Bio/Tedmology 12:181-184.) 

Plant systems may also be used for expression of DME. Transcription of sequences encoding 

5 DME maybe driven by viral promoters, e.g., Ihe 35S and 19S promoters of CeMV used alone or in 
combination with the omega leader sequeiK^e from TMV (Takamatsn, N. (1987) EMBO J. 
6:307-311). Atoeriiatively, plant promoters such as the sniallsubunit of RUBISCO or h^ 
promoters may be used. (See, e.g., Coruzzi, G. etal (1984)EMBOL 3:1671-1680; BrogIie,R.etaL 
(1984) Science 224:838-843; and Wmter, J. et aL (1991) Results Probl. Clell Differ. 17:85-105.) These 

10 constructs can be introduced into plant cells by direct DNA transformation or pathogen-mediated 

transfection. (See, e.g.. The McGraw KQll Yearbook of Science and Tcdmology (1992) McQraw IBll, 
New Yoj*: NY, pp. 191-196.) 

T paTntnaTi an cells, a number of viral-based eiqgession systems may be utilized. Incases 
- whcio an adenovinis is used as an expres^on vector, sequ^ices ^ocoding DME may be Egated into 

15 an adenovirus transcription/transbtion compfex cfmsisting of the late promote and tripartite leador 
sequence^ Insertion in a non-essential £1 or E3 region of Ac viial g^ome may be used to obtain 
infective virus expstsscs^ DME inbost cells. (See, e.g., Logan, J. and T. Shenk (1984) Proa 
NatL Acad. ScL USA 81:3655-36590 In addition, transcdption enhancers, such as the Rous sarcoma 
virus (RSV) enhancer, may be used to increase eayrcssion in mammalian host cells. SV40orEBY- 

20 based vectors may also be used for high-level protdn expression. 

Boman aitifidal chromosomes (HACs) may also be employed to deliver larger fragments of 
^DNAttmcanbe contained ni and expressed from a plasnnd. HACsof abont6M>to 10 Mb are 
constructed and delivmd via conventional delivery methods (liposomes, polycationic amino polym<^, 
or vesicles) for tiierapeotic purposes. (See. e.g., Harrington, J J. et al (1997) Nat Q&oieL 15345- 

25 355.) 

R>r long tcxm production of recoinbinant protons in mammalian systegoos, stable expression of 
DMB in cell fines is prc&rred. For exaiiq»lB^S6q[n^K;es encoding DME canbe transformed into cell 
lines nsintg expression vectors whidi may contain viral origins of replication and/or endogenous 
expression elements and a selectable mado^ giene on tiie same or on a sepamte vector. FoQowmg die 
30 introduction of the vector, cells may be allowed to grow for about 1 to 2 days in enriched media before 
being switched to selective media. The purpose of the selectable rnaiker is to confer resistance to a 
selective ag^it, and its presence aDows growth and recovery of cells whidi successfully express the 
introdoced sequ^ices. Resistant dones of stabl^ transformed cells may be propagated using tissue 
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cahuie tedbmqpies appropnatB to die cell type. 

As^ onmbcx of selection systems may be used to recover transfbrmed cell lines. These 
inchide, but are not limited to, the herpes simplex virus fliymidine kinase and adenine 
phosphoribosyhransferase genes, for use in tkr and apr ceDs, respectively. (See, e.g.. Wigler. M. et 

5 al. (1977) Cell 11:223-232; Lowy, L et aL (1980) Cell 22:817-823.) Also, anlimetaboHte, antibiotic, or 
hcrbidde resistance can be used as fte basis for selection. Vox example, dhfr confers resistance to 
methotrexate; neo conf^ resistance to the aminoglycosides neomycin and G-418; and ais and pal 
confeiedstancetocMorsalfnronandphospliinDtiicinacetyltr^ (See,e.g., 
Wigler, M. et al. (1980) Proc. NatL Acad. ScL USA 77:3567-3570; Colber^Garapin, F. et aL (1981) 

10 J. MoL Biol. 150:1-14.) Additional selectable genes have been desCTbed. e.g., trpB and hisD, which 
alter cellular requirements for metabolites. (Sec, eg., Hartman. S.C and MolSgan (1988) Ploc. 
Nad. Acad. ScL USA 85:8047-8051.) Visflde maiiras, e.g., anflwcyanins, green fluorescent proteins 
(GFP; Qontedi), B ghicuronidase and its substaate B^coronide, or todferase and its substrate 
tadferinmaybensed. Ihese nwrlcetscm be iisc^ not only to identify ttansfon^ 

15 quantifyttKamanntoftransieotorstabbiTOtrincspicssionatlributab^ 

(See, e.g., Rhodes. CA. (1995) Mefliods MoL BioL 55:121-131.) 

ABhou^ die presoice/Asence of maker geaw egression sug^ts Aat ae gen^ 

is also iMBsent, the ptesemce and raqptression of flie gMK inay n^ 
Ac se<peiK»encodingDMB k inserted within amaiter gene seq^ 
20 scqnimcesenc»dmgDMEcanbeidentifiedbyfl»abs<aiceofmarbage^ 

matkex gene canbeplaced in tandem wifti a sequeooe racodiiig DME under the conttol of a siiJgle 
pnxmbtcr. BxpiesaiDnof tbemaABrgpmeinresponselDinductionorselectiDnusasdfy 

oqnesaon of die tandem giene as welL 

In genena. host cdb fliat contain to nnclac add seq^^ 

25 DMBrnaybeideaitifibdbyavarietyofpiocednresbiDwntolliDseofsmintheart These 
procedures indnde,bm ate not Smiled to, DNA-DNA or DNA-RNAhybridizations, 
amplification, and protein bioassay or immnnoassay techniques which include membrane, solution, or 
dbtq> based technologies for fl» detection and/or quantification of nncfcic add or protein sequences, 
hmmmologicalmethods for detectirig and measuring fte expresaon of DME u^ 

30 spedfic polyctondormonoctonid antibodies are Impwnin the a^ Examples of sudi tedmiques 
indude eazyme-Hnked immunosorbent assays (EUSAs), radioimmunoassays (MAs), and 
flnoiescence activated cell sorting (FACS). A two-site, monoclonal-based immonoassay utilizing 
monoclonal antibodies leactivB to two noit-interfaing t^ritopes on DME is preferred, but a competitive 
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binding assay may be employed. These and other assays are well known in the art (See, e.g., 
Hanqjton, R et aL (1990) Seiplogical Methods, a Laboratory Manual APS Press, St PaulMN, Sect 
IV; Coligan, J JB. et al (1997) Cnrrent Protocols in Lnmunology, Greene Pub. Associates and Wiley- 
Interscience, New York NY; and Pound, J,D. (1998) TrntminQch emical Protocols. Humana Press, 
5 TotowaNJ.) 

A wide variety of labels and conjugation techniques are known by those skilled in the art and 
maybe used in various nucleic add and amino add assays. Means for producing labeled hybridization 
or PGR probes for detecting sequences related to polynucleotidBS encoding DM£ indade oligolabeling, 
nick translation, end-labeling, or PGR amplification using a tabekd nackotide. Ahemalively, the 

10 sequemxes eaicoding DME, or any fcagmcsnts fliereof, maybe cloned into a vector for flie production of 
anmRNAprobe. Sudi vectors arc known in tibe art, am conmieidally availably 
syolbesize RNA probes in vitro by addition of an appropriate RNA polymerase such as T7 , T3 , or SP6 
and labeled nncleolides. Theseproceduresnmybeconductedusinga variety of commeida^ 
available kits, such as those provided by Ameisham Fbatmada Biotech, Ptomega (Madison WI), and 

15 US BiodiemicaL Smtable reporter nmolecales or labebwMdbiinay 

indnde radiomiclides, enzymes, fluoresce dbemibminescent, or dbromog^nic ag^, as well as 
substrates, cofactors, inhibitors, magnetic particles, and the Ske. 

Host celk transformed with nackotide sequences encodmg DME UMiy be 
conditions suitable for the ejsprcsaon and recovoy of flie protein firo Tbeprotdn 

20 produced by a transformed ceUinay be secreted or retafajed intracelWarlydepaDding on the sequence 
and/or the vectorused. As will be nnderstxK>d by diose of skiQ in tibe art, egression vec^ 
polynucleotides wMcb eaicode DMB may be designed to oontmn signal sequences which direct 
secretion of DME tfmm^ a ptolcaiyotic or eukarjrotic cell membrane. 

In addition, a host cell strain may be chosen for its ability to modulate espiession of die 

25 inserted sequences or to process the eiqwessed protein m fee desi^ Such modifications of 

the polypeptide include, but are not limited to, acetylation, carboxylation, glycos^ation, phosphorylation, 
lipidation, and acylation. Post-translational processing which cleaves a >rcpro'' or '^pro'" form of the 
protein inaydso be used to sped^ protein targeting, folding, and/or activity. Differ^ host ceDs 
whidihave specific cellular machinery and diaracteristic mechanisms for post-translational activities 

30 (e.g., CHO, HeLa, MDCK, BDBE293, and are available firom the American Tjpe Culture 
Collection (ATCC, Manassas VA) and may be choseai to ensure the correct modification and 
processing of the foreign protdm. 

In anoth^ embodiment of the invention, natural, modified, or recombinant mideic acid 
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sequences encoding DMB may be Bgated to a heterologous sequeiice resulting in translation of a 
fusionproteininanyoffheaforesn^onedliostsystenis. For example, a dbiniericDMB protein 
containing a heterologous moiety fliat can be recognized by a commercially available antibody may 
facilitate the scre^iing of peptide libraries for inhibitors of DME activity. Heterologous protein and 
5 peptide nroieties may also facilitate puriScation of fusion proteins using conunerdaDy avaUable afSniiy 
matrices. Such moieties include, but are not limited to, ghitathione S~transferase (GST), maltose 
binding protein (MBP), thioredoxin (Trx), calmodulin bmding peptide (CBP), 6-His, FLAG, c-wyc, and 
hemagglutinin (HA). GST, MBP, Trx» CBP, and 6-His enable purification of their cognate fusion 
proteins on immobilized gtutadiione, maltose, phenylarsine oxide, calmodulin, and metal-chelate resins, 
10 respectively. FLAG, c-wyc, and hemagglutinin (HA) enable immunoafBnity purification of fiision 
proteins using conun^aliy available monoclonal and polyclonal antibodies tbat specifically recognize 
fliese epitope tags. A fusion prote^ may also be enginemd to contain a proteolytic cl^ 
located between die DME enco£ng sequence and die hoterologoiis pro 

may be cleaved away fiom the heterologous moiely foBowing purification. Me&ods for fiidon protdn 

IS expiession and purification are discussed in Ausubel Q99S. supra , di. 10). A variety of oommercially 
. available tits rnay also be used to fiadfitato 63^^ 

In a fanh^ embodimeiit of the invitation, synthesis of radiolabded DMB maybe achieved in 
vitro using tbo TNT rabbit reticuJocvtc Ivsatc or wheat germ extract s vston rPromega\ These 
systiEm& couple transcnqption and translation of protean-coding sequences op^rably associated with the 

20 T7,T3, or SP6 promote. lYandation takes place in die presence of a radiolabeled.amhu^ 
precursor, for exmxiple^ ^S-rmeduonine. 

DMB of the ptes&A invention or fi:agaijmt5 dicieof may be used to screen for compounds diat 
specifically bind to DMR At least one and up to a plurality of test coinpounds may be screened for 
specific bmding to DME. Exanq>les of test compcmii5& include antibodies, oligonucleotides, {^oteins 

25 (e.g., receptors), or small nK>lecules. 

In one CTibodiment, the compound thus identified is closely related to die natural ligand of 
DMB, e.g., a ligand or fi^agm^t diereof, a natural substrate, a structural or fimcdonal mimetic, or a 
natural binding partner. (See, e.g., CoKgan, J.E. et aL (1991) Corrent Protocols in ImmunoIogY 1(2): 
Chapter 5.) Similarly, the compound can be closely related to die natural receptor to which DME 

30 binds, or to at least a imgment of tiie receptor. e.g., the ligand binding site. In either case, the 
coiiiqpound can be ratioiiaDy designed using Imowntedbniques. In oiie ^bodiment, scre^iing for 
diese compounds involves producing appropriate cesBs which caq^ress DMB, eidier as a secreted 
protein or on the ceQ membrane. Pfeferyftd r^Jht mnlnde c&TIs from TnatnTnak ^ yeast, DrosopMhu or E. 



61 



wo 02/12467 PCT/USOl/24382 



colL CeQs expressing DME or cell membrane fracdons which contain DME arc then contacted with a 
test compound and binding, stimulation, or inhibitiion of activity of either DME or the compound is 
analyzed. 

An assay may simply test binding of a test compoimd to the polypeptide, wherein binding is 
5 detected by a fhiorophore, radioisotope, enzyme conjugate, or other detectable label. For example, the 
assay may comprise the steps of combining at least one test compound with DME, either in solution or 
affixed to a solid support, and detecting the binding of DME to the compound. Alternatively, the assay 
may detect or measure binding of a test compound in the presence of a labeled competitor. 
Additional^, the assay may be carried out using cell-free preparations, chemical libraries^ or natural 
10 product mixtures, and ftie test compound(s) may be j&ee in solution or affixed to a solid support 

DME of the present invention or fragments thereof may be used to screen for compounds that 
modulate the activity of DME. Sudicon^unds may include agonists, antagonists, or partial or 
inverse agonists. In one embodiment, an assay is peiibimed under conditions permissive for DME 
activity, wherein DME is combined with at least one test compound, and the su^dvity of DME in 
15 presence of a test c»3mpound is co!nq>atedwidi the activity of DA^ 

compound. A dbangem the activity of DMB in &e presence of fiie test compou^ 
conqmund tibat modulates die activity of DMB, Alternatively, a test con^und is conibinjc^wiQi an in 
vitro or cell-free system conqixising DME und^ conditions suitable for DME activity, and die assay is 
performed. Ineidierof these assays, a test coxnpound which modulates die activity of Dl^ 
. 20 so indirectty and need not come in dusct contact widi the test con^o^ Atleastoneandupto a 
pfairaHty of test compounds may be screened. 

In another embcidincient, pol^imclBotides ODcod^ ^ 
Imocked ouf in an animal model system using homologous iBCombinadon in CTbryomc stem (BS) 
cells. Sudi techniques are weHlmown in the art and are useM for die gyration of anin 
25 human disease. (See, eg., U.S. Patent Nunoibcar 5,175383 and U.S. Patent NurribCT 5,767337.) For 
exan^k, moose BS cells , such as the mouse 129/SvJ cell line, are dedved fix>m die eady mouse 
^nbiyo and gtovm in cabuie. The ES cdls are transformed widi a 

interest disrapted by a marker gpne, e.g. , die neomydn phosphotransforase g^ (neo ; Capecdii, 
M.R. (1989) Science 244:1288-1292). The vector integrates into the corresponding region of the host 
30 g^ome by homologpus recombination. Alternatively, homologous recombination takes place using the 
Cre-loxP system to knockout a gcaie of interest in a tissue- or developmental stage-specific manner 
(Mardi» J J). (1996) Clin, hivest 97:1999-2002; Wagner, K.U. et al (1997) Nucldic Adds Res. 
2S>i323-4330). Transformed £S cells axe identified and rnioroiiyected into nwusecen^ 
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as those fix>m the C57BI76 mouse strain. The blastocysts are surgicaflytransfOTed to 
pseadopiegnant dains» and ihe lesultiiig dmnmc progCTy are genotyped and bied to produce 
heterozygous or homozygous strains. Transgeiiic aiumak thus generated may be tested with pot^tial 
therapeutic or toxic agents. 

5 Polynucleotides encoding DME may also be manipulated in vitro in ES cells derived from 

human blastocysts. Human ES ceDs have die potential to differentiate into at least ^ght separate cell 
lineages including endodemi, mesoderm, and ectodermal cell types. These cell lineage differentiate 
into, for example, neural ceBs, hematopoietic lineages, and caidiomyocytes (Thomson, J. A. et aL 
(1998) Science 282:1145-1147). 

10 Polynucleotides Micoding DME can also be used to create 'IcnocMn'' humanized animals 

(pigs) or transgenic animals (mice or rats) to model human disease. With knoc'kin technology, a region 
of a polynucleotide encoding DME is injected into animal ES cells, and the injected sequence 
integrates into the animal cell g^iome. Transformed cells are injected into blastulae, and the blastalae 
are implanted as described above. Transgoodc progeny or inbred hnes are stodied and treated widi 

15 potential pharmaceutical agj^its to obtain info^^ Altcxnatively, 
a mammal inbied to ovi^xpiess DME, e.g.,by seoreting DME in its milk, may also smtc as a 
convenient-source of Ihs^ protdn (7aime> J. et aL (1998) Biotechnol Amm. Rev. 4:55-74). 

THERAPEUTICS 

Cbenucal and stractoral similarity, e.g., in the context of s 

20 regions of DME and drag metabolizbg ^izymes. Inadditiorwtheej^firessionofDMEiscbsely 
associated vnOx a variety of diseased tissues, including ftat of the brain, prostate, bone, intestine, and 
breast Therefore, DMB ^vpeais to play a lole in aatoimmnne^nflamniatoiy, cell prolife-ative, 
develDprnentd, eDdocono, eye, metabolic, and gastrointestinal disorders, including Hrer disorders. In 
die treatment of disorders assodated wiSti increased DME egression or activity, it is desirable to 

25 decrease flieexpresdon or activity of DME. In the treatrnent of disorders assodated mth decreased 
DME eiqxression or activity, it is desirable to increase the cxptes&ion or activity of DME. 

Thearefore, in one embodiment, DMB or a firagment or derivative diereof maybe administered . 
to a subject to tteat or prevent a disorder assodated wifli decreased e?cpression or activity of DME. 
ExatE;»]es of such disord^ include, but are not fimited to, an autormmune^nflammatory disordra-, sudi 

30 as.acquiicd immunodeficiency syndrome (AIDS), Addison's disease, adult respiratory distress 

syndrome, allergies, ankylosing spond^Stos, am^idods, anemia, asduna, atherosclerosis, autoimmune 
hemolytic anomia, autoimmune diyroiditis» autoimmune polyendocrinopadiy^candidiasis-ectDdermal 
dystioi^y (APECED), brondiitis, diolecystitis, contact d^nnatitis, Crohn's disease, atopic dermatitis, 
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dennatomyositis, diabetes melEtas, CTiphysema, episodic tympihopMiia with lymi^iocytotoxins, 
eiydiioblastosis fetafis, eiythema nodosum, atropihio gastdtis, glomerolonephritis, Goodpastoic's 
syndrome, gout. Graves' disease, Hashimoto's thyroiditis, hypereosinophilia, irritable bowel syndrome, 
multiple sclerosis, myasthenia gravis, myocardial or pericardial inflanamation, osteoarthritis, 
5 osteoporosis, pancreatitis, potymyositis, psoriasis, Reiter*s syndrome, jteumatoid arthritis, sclerodemaw, 
Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, 
thrombocytop^c purpura, ulcerative colitis, uveitis, Werner syndrome, complications of cancer, 
hemodialysis, and extracorporeal circulation, viral, bactmal, fimgal, parasitic, protozoal, and helminliiic 
infections, and trauma; a cell proBfcrativB disoid^, sudh as actinic k^atosis, att^osderosis, 
10 atherosclerosis, bursitis, dnhosis, hepatitis, mixod connective tissue disease (MCTD), myelofibrosis, 
p^xysmsd nocturnal hemoglobinDria, polyc^tibaDQia vera, psoriasis, primary Ihrombocydienua, and 
cancefs jjnnlnfKi^g adenocardboma, leakmiia, lymphoma, melanoma, mydoma, sarcoma, 
tetatocatdbaoma, and, in particular, cancm of the adrraial gland, bladder, bone, bone marrow, brain, 
breast, cervix, gall bladd^, ganglia, gastscointestinal tract, heatt, Mdney, Kvct, lung, muscle, ovary, 
15 panoras, parathyroid, pcmis, prostate, salivary glands, skin, spleen, testis, thymus, diyroid, and uterus; a 
devebprnental disorder, sndi as reiial tabular acidosis, ani^iiia, Cndiing's syndrome, adiondroplastic 
dwarfism, Dudbestme and Becker muscular dystrophy, e^epsy, gonadal dysgenesis, WAGR syndrome 
(Wilms* tumor, aniridia, gemfiouiinary abnormalities, and mental retardation), Smith-Magenis syndrome, 
myelodyspkstic syndrome, hereditary muco^itbdial dysplasia, hereditary karatodermas, hereditary 
, 20. neuropathies such as Charcot-Maii&-Too& disease and neurofibromatosis, Iqfpolhyroidism, 
hydrocephahis, seizure disordm sudi as Syndenham's chorea and caiebral palsy, spina bifida, 
anencqphaly, cranioracluschisis, congeiutal glaucoma, cataract, and sensorineural hearing loss; an 
endocrine disord^, such as disorders of the hypolhalamDns and pituitary resulting firom ksions sudi as 
primary bram tumors, adenomas, in£arction associated with pregnancy, hypophysectomy, aneurysms, 
25 vascular malformations, tibrombosis, infections, immunological disord^, and complications due to head 
trauma; disorders associated widi hypopituitarism including hypogpnadism, Sheehan syndrome, 
diabetes insipidus, Kalbnan's disease, Hand-Schulte-Christian disease, Letterer-Siwe disease, 
sarcoidosis, en^ty sella syndrome, and dwarfism; disorders associated with hyperpituitarism including 
acromegaly, gi^ntiCTn, and syndrome of inappropriate antidiuretic hormone (ADH) secretion (SIADJBO 
30 often caused by benign adenoma; disorders associated with hypothyroidism iuchiding goiter, 

myxedema, acute thyroiditis associated with bacterial infection, subacute thyroiditis associated with 
viral infection, autoinmmne thyroiditis (Hashimoto's disease), and cretinism; disorders associated with 
hypex&yroidism including thyrotoxicosis and its various forms. Grave's disease, pretibial myxedema. 
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tone multinDdular goiter, fliyroid caidnoma, and PhinmiBr's disease; disorders associated wife 

hypeiparafliyroidism inctoding Conn disease (caironic hxpercabmia); pancreatic disorders sudi as 

Type I or Type n diabetes meffitus and assodated copaplications; disorders associated with flie 

adrenals soch as hyperplasia, carcinoma, or adenoma of flie adrenal cortex, hypertension associated 

wifli alkalosis, amyloidosis, hypokalemia, Coshing's disease, liddle's syndrome, and Amold-Healy- 

Gordon syndrome, pheochromocytoma tumors, and Addison's disease; disordeis assodated wifli 

gonadal st^oid hormones sudi as: in women, abnormal prolactin production, infertility, endometrioids, 

perturbations of flbe menstraal cycle, polycystic ovarian disease, hyperprolactinMnia, isolated 

gonadotropin defidency, ainetK>rihea,galactDrAea,hermaiflttoditi5ni,liiisiitimia^ 

cancer, and. in post-mcnopaosal women, osteoporosis; and, in men, Leydig cell dc&aeosy, mak 

climacteric phase, and germinal cell aplasia, hypergonadal disoidras assodated wifli Leydig cell 

tumors, androgen resistance assodated wifli absence of androgemeceptois, syndrome of 5 a- 

reductase, and gynecomastia; an eye disorder, sndi as coiguiiclivitis,lE«atoc^ 

kmtitis, ^isclcritis, iritis, posterior uvdtis, ^coma, amauio!^ 

optic neuritis. Leber's heieditaiy optic neiiropa%. 

detadmieait, cataract, macular degoienitioo, ceo^ 

melanoma of flte dioroid, iBtiobulbar turner, and ciiiasmidtimiDr, a mehOx) 

Addison's disease, c<arf>rotendinDns xanfljomatosis. congpmtd 

lesistitiice. cystic fibrosis, diabetes, ftt^hepatodnhosis, fiTictose-l,6-diphDsi*atas6 dcfidemcy, 
galactosemia, gpiter. ghicagonoma, glycogen storage diseases, hereditary ftnctose intolerance, 
lq5Kradieiialism,liypoadreauJism,hyperpar^ 
lQ(peiflisnridism,liypogljcen»a,lqfpofliyro 

lipodystrophies, lysosomal storage diseases, Menkes syndrome, ocdpitaihom syndrome, mannosidosb, 
DBuranmiidase defideaicgr, obesity, pcaitosuria pheiqrlkBtonuria, pseudovitamin D-defidraicy rickets; 

; and agastioinli»tinddisoider,sadi as dyspJwgia, peptic esoplia^ 

strictote. esophageal cardnoma, dyspepsia, mdigestion, gastritis, gastiac carcinoma, anorexia, nausea, 
cmDas, ©BtiopaiBsis, antral or pyloric edema, abdominal angina, pyrosis, gastroeaiteritis. intestinal 
obsliuction. infijctions of die intestinal trad, jiep&i vkei, diolelifliiasis. cholecystitis, cholestasis, 
pancteatilis, pancreatic caidnoma, biliary tract disease, h^atitis, hyperbilinibineinia, hereditary 

) hypefbilirabinania, diApsis, passive congestion of flie Uver, hepatoma, infectious coUtis, ulcerative 
colitis, ulcMativB proctitis, Crohn's disease, Whipple's disease, Mallory-Weiss syndrome, colonic 
caidnoma. colonic obsUaction, irritable bowel syndrome, short bowel syndrome. diarAea. constipation, 
gartrointestinalheanoiriiage, acquired hmironDdefidencysyndroi^ 
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hepatic cBcephalopalhy, hepatorenal syndrome, hepatic steatosis, hemochromatosis, Wilson's disease, 
alphaj-antitrypsin deficiency, Reye's syndrome, primary scferosing cholangitis, liver infarction, portal 
vein obstruction and thromibosis, centrilobular necrosis, peliosis hepatis, hep&lic vein thrombosis, veno- 
occhisive disease, preeclampsia, eclan^ia, acute fatty liver of pregnancy, intrahepatic cholestasis of 
5 pregnancy, and h^atic tumors including nodular hyperplasias, adenomas, and carcinomas. 

hi another emhodiment, a vector capable of expressing DME or a fragment or derivative 
thereof may be administered to a subject to treat or prevent a disorder assocdated wifli decreased 
expression or activity of DME including, but not limited to, those desciibed above. 

hi a further embodim«it, a composition comprising a sobstantialty purified DM£ in conjunction 
10 wilh a suitable phannaceuticd earner may be administered to a subject 

assodated with decreased expression or activil^ of DME innlndrns, but not fimiled to, those provided 
above. 

In still another embodimesit, an agonist which modulates the activity of DME maybe 
adnunist^red to a subject to treat or prevent a disoid^ associated wifli decreased expression or 
1 5 actrvity of DME including, but not limited to, fliose listed above. 

M a fiirdier embodiment, an aittagoiust of DME ntmy be ad^ 
prevQDt a disorder associated with increased esjiression or activi^ of DME. Examples of such 
disoidMs inclnde, but are not licoited to, those antoimmBnne^nflanimatoiy, cell proliferative, 
developiniaital, eaidocrinc, eye, metabohc, and gastraintestiria^ 
20 described above. Inorie aspect, an antibody whkdispedficalfybindsDMEmay be used direct^ 
antagonist or indirectly as a targeting or delivery mechanism for bringing a pharmaceutical ^gent to 
cells or tissues which exinress DME. 

In an ad^liti^^al oooibodimi^ a vector e^ressir^ die cotapkmoA of die polynucleotide 
^icoding DME may he administered to a subject to treat or prevent a disorder associated widx 
25 iiicreasedeq>ressionor activity of DME inc^ding, but iu>t]inQited to, &ose^ 

In other embodimi^its, any of die proteins, antagonists, antibodies, agonists, conj^lementary 
sequences, or vectors of the inv^on may be administered in combination with odier appropriate 
dicrapeutic ag^ts. Selection of the appropriate agents for use in coiribination therapy may be made 
by one of ordinary sldll in the art, according to conveoitional pharmaceutical principles. The 
30 cotribination of therapeutic agents may act synerg^tically to effect the treatment or prevention of the 
various disorders described above. Using tiiis approach, one may be able to achieve dierapcutic 
^cacy with lower dosages of eadt agent, dms reducing the potential for adverse side effects. 
An antagonist of DME niay be produced iising niettiods wMch are generally 
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Itt pardcular, piuified DME may be used to produce antibodies or to screen libraries of pbannaceudcal 
agents to identify tihiose whtcb specificatty bind DME. Antibodies to DME may also be generated 
using methods that arc well known in the art Such antibodies may include, but are not limited to, 
polyclonal, monoclonal, diimMic, and single cham antibodies. Fab firagments, and fragments produced 

5 by a Fab e^qiression library. Neutralizing antibodies (ie., those which inhibit dimer formation) are 
generally preferred for therapeutic use. 

For die production of antibodies, various hosts including goats, rabbits, rats, mice, humans, and 
nf^ptrg Tnny he^ \mn mr\r7£ ic{ by injection widi DME or with any fragmoat or oligopeptide thereof which 
has immunogenic properties. Depending on the host species, various adjuvants may be used to 

10 increase immunological response. Such adjuvants include, but are not limited to, Frennd's, mineral gels 
sudi as aluminum hydroxide, and surface active substances such as lysolecithrn, ploronic polyols, 
polyanions, peptides, oil enmlsions, E3JH, and dinitrophenoL Among adjuvants used iafaumajDis, BCG 
(badHi Calmctfcc-Guerin) and Corvnebacterium parvum are especially preferable. 

It is preferred that the oligppeptides, peptides, or fragments used to induce antibodies to DME 

IS have an amino add sequence cxmsisting of at least about 5 anoino acids, and g^nsrally will consist of at 
least about 10 amino adds. R is also i»efe:tibletihat these ofigopqitides, peptides, or frag^ 
identicd to a portion of tiieannno add sequence of die natural prote^ Short stretches of DME anuno 
adds may be fused with those of anotiier protein, sudi as KLH, and antibodies to die diimmc 
molecule may be prpdaced. 

20 ' Monoclonal antibodies to DMB may be prepared using any tedmique wbidi provides forihe 

production of antibody moleoiles by contimcus cell lines in culture. These include, but are not limited 
to, Ac hybridoma tedmique^ tl» homan B-cellhybridoma tedimque, and die EBV-hybridoma 
tedmique. (See, e.g., Kohte, G. ^ aL (1975) Nature 256:495-497; Kozbor, D. et aL (1985) J. 
finmnnoL Methods 8131-42; Cote, RJ. et aL (1983) PJroc. NatL Acad. Sd. USA 80:2026-2030; and 

25 Cole, SJ». ^ aL (1984) MoL CcflBiol 62:109-120.) 
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D.R. (1991) Proc. NaU Acad. Sd, USA 88:10134-101370 

Antibodies may also be produced by indacmg in vivo production in the lymphocyte population 
or by screearing iimnunoglobulin libraries or panels of highly specific binding reagents as disclosed in 
the hteratore. (See, e.g., Qrkndi, BL et al (1989) Pioc. NatL Acad. Sci. USA 86:3833-3837; Winter, 
5 G.etal. (1991) Nature 349:293-299.) 

Antibody fragments whidi contain specific binding sites for DME may also be generated. For 
exanq>]e, such firagments include, but are not limited to, V(sb\ fi-agments produced by pepsin 
. digestion of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of 
the F(ab^2 fragments. Alteanatiwly, Fab expression libraries maybe constructed to aHow rapid and 
10 easy identification of monoclonal Fab fragments with the desired spedfid^. (See, e.g., Ebise, WJ[>» 
et aL (1989) Sdeaice 246:1275-1281.) 

Various hnnmnoassays maybe used for screeaiiiig to identify antibodies having the desired 
spedfidty. Numi^roas protocols for conxpetitive binding or inmmnoradiometric assays using ei&er 
polyclonal or nwnoclonal antibodies witii estab^shed spedfidties are weninovm in the art Sndi 
IS innnunoassays typically involve tiie measurCTieint of complex formation between DME and its specific 
antibody A two-site, monoclonal-based inamnnoassay utilizing monoclonal antibodies reactive to two 
non-ittt^ering DME qxitopes is g^ii^a% used, but a competitive binding assay may also be employed 
(Pound, su^a). 

Various methods sodi as Scatchard analysis in coi^unction vritii radioimmunoassay tedmiques 
20 may be used to assess the affinity of antibodies for DME. Afbuty is expressed as an assodation 
constant, whidi is defined as tiie molar conc^itration of DMB-antibody conqilex divided by 
molar conceiiliations of fi:eeantig^ and £reeantn>odyTOL^^ TheK. 
determined for a preparation of polyclonal antibodies, whidi are heterogeneous in their afBnities for 
multiple DME epitopes, represents the average aftinity, or avidity, of the antibodies for DME. The £^ 
25 detmnined for a preparation of monoclonal antibodies, which are monospecific for a particular DME 
epitope, re^esents a tme measure of affinity. I£^-a£Gnity antibody preparations witii ranging 
fiom about 10^ to 1(P Ltoole aie preferred for use in immunoassays in whidi tiie DME-anlibody 
complex must witiistand rigorous manipulations. Low-afBnity antibody preparations witii E^, ranging 
from about 10* to 10' L/mole are preferred for nse in immnnopurification and sirnDar procedures 
30 which ultimately require dissociation of DME, preferably in active form, from the antibody (Catty, D. 
(1988) Antfhodies , Volume L A Practical Approach , IRL Press, Washington DC; liddeh, J JB, and A. 
Crycar (1991) A Practical Guide to Monoclonal Antibo^es, John Wiley & Sons, New York NY). 

The titer and avidity of polyclonal antibody preparations may be further evaluated to detmnine 
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die qaaKty and suitability of such jxr^arations for ccxtain downstream supplications. For example, a 
polyclonal antibody prqiaration containing at least 1-2 mg specific antibody/ml, preferdbly 5-10 mg 
specific antibody/ml, is generally employed in procedures requiring precipitation of DME-antibody 
con^lexes. Procedures for evatnating antibody specificity, tit^, and avidity, and guidelines for 
5 antibody quality and nsagp in various applications, are generally available. (See, e.g.. Catty, supra, and 
Coligan et al supra. ) 

In another ernbodimeoit of the invention, die polynncleotides encoding DME, or any firagment 
or complement thereof, may be used for therapeutic purposes. In one aspect, modifications of gene 
expression can be achieved by designing complementary sequences or antisense molecules (DNA, 
10 RNA, PNA, or modified oligonucleotides) to Ihe coding or regulatory regions of Ihe gene encoding 
DMB. Su(^ technology is wen known in ^ art, and antisense oU^ 

can be designed from vatioos locations along the coding or control regions of sequences encoding 
DMB. (See, e.g., Agrawal» S., ed (1996) Antisense Hi^peutics^ Bunana Press Inc., Totawa NT,) 
In fbctapexilic use, any gtne deSvay system suitable for introdnction of tiie antiseuse 

15 sequ^ces into sqypropriate target cells can be used. Antisense sequoices can be defivraed 

intracelhilatly in the form of an e3qp!ressi0n plasmid wbidi, upon trausodption, produces a seqn^ice 
■ oomplenGu^itary to at least a portk>n of the celhilar sequence (See,e.g., 
Slater, LB. et aL (1998) J. Allergy CBa ImmunoL 102(3):469-475; and Scanlon, K. J. et aL (1995) 
9(13):1288->1296.) Antisense sequences can also be introduced intracelhilarfy through the use of viral 

20 vectors, sudi as retrovinis and ad^io-assodaled virus vectors. (See> e.g.. Miller, AX). (1990) Blood 
, 76:271; Ausnbel, supra: Uckert, W. and W. WaKher (1994) EhatmacoL Ther. 63{3):323-347-) CMier 
graie defivery merJiauisms include fiposomc-dmved syst^DOs, artificial vind caivclopes, and ofiu^ 
^sterns known m &e art (See, e.g., Rossi, J J. (1995) Br. Med. BuE 51(1)217-225; Boado, R.J. €t 
aL (1998) J. Phann. Sci 87(11):1308-1315; and Morris, M.C et aL (1997) Nuddc Adds Res. 

25 25(14):2730-i736.) 

In anotiier ^nbodtme^ of the invention, potynudeotides encoding DME may be used for 
somatic or germline gpoe therapy, dene dier^y may be performed to (i) correct a genetic delBciency 
(e.g., in the cases of severe combined immnnodefidency (SCID)-X1 disease charactmzed by X- 
linked inheritance (Cavazzana-Calvo, M. et aL (2000) Sd^ice 288:669-672), severe combined 

30 hnmunodefid^y syndrome associated with an inherited adenosine deamin^e (ADA) deficiency 
(Blaese, ILM. et al (1995) Sdence 270:475-480; Bordignon, C et aL (1995) Science 270:470-475), 
cystic fibrosis (Zabner, L et aL (1993) CeH 75:207-216; Crystal, R.G. et al. (1995) Hum. Gene 
Therapy 6:643-666; Ctystal, R.G. et aL (1995) HimL G^ Th^y 6:667-703), thalassamias, fiomilial 
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hyp^cholesteroleiiiia, and hemophilia resulting from Factor VIII or Factor IX deficiencies (Crystal, 
R.G. (1995) Sd^ice 270:404-410; Veima, LM. and N. Soma (1997) Nature 389:239-242)), (ii) 
express a conditionally lethal gene product (e.g., in the case of cancers which result from unregulated 
cell proBfcration), or (iii) egress a protein whidi affords protection against intracellular parasites (e.g., 

5 agaiDst human retro^viruses, such as human immunodeficiency virus (HIV) (Baltimore, D. (1988) 
Nature 335:395-396; PoescMa, E. et al (1996) Proc. Natl. Acad. ScL USA. 93:11395-11399), 
hepatitis B or C virus (HBV, HCV); fungal parasites, such as Candida albicans and Paracoccidioides 
hrasfliensis : and protozoan parasites such as Plasmo dium feldp anim and Trypanosoma cnm) . In the 
case where a genetic deficiency in DME expression or regulation causes disease, the expression of 

10 DME from an appropriate populatiDn of transduced cells may aOeviafce flie clinical mamfbstalions 
caused by the g^etic defici^K^. 

Id a furtiier emibodiment of ^ invesition, diseases or disoidm caused hy defidendes in DMB 
ate treated by constructing mammalian expressioiL vectois encoding DMB and intrc^dng these 
vectors by medianical means into DMErdefideot cells. Medianical transfer tedmologies for use wi& 

IS cells in vivo or ex ytero mchMte (i) direct DNA microinjection mto individual ceDs, (n) ballistic gold 
particle defiveary, Coii) %>s<mi&-mediated transfe^^ 

use of DNA transposons (Morgm, RA. and W.R Anderson (1993) Annu. Rev. Biochem. 62:191- 
217; Ivies, Z. (1997) Cell 91.501-510; Boulay, J-L and R Redpon (1998) Coir. BiotedmoL 
9:445-450), 

^20 !&qnession vectors tibat maybe elective for the esqxression of DME include, but are not 

Smited to, die K3)NA 3.1, EPITAG, ERCCMV2, PREP, PVAX, PCR2-TOPOTA vectors 
(hivitrogen, Cadsbad CA), PCMV-SCEIPr, PCMV-TAG, PEGSBMmV (Stratagcaic. La Jolla CA), 
and PTBT-OEF, FTET-ON, FrRE2, FrRB2-LUC, PTK-HYG (CaonlBdi, Palo Alto CA). DMB may 
be eqnessed using (i) a constilQtive:^ active promoter, (e.g., from cytomegailovinis (CMV)» R<nis 

25 sarcoma virus (RS V), SV40 virus, thymidine kmase (TK), or P-actin genes), (ii) an inducible promoter 
(e.g., die tetracyclm&-iegalatBd promoter (Goss^ M. and BL Bujard (1992) Proc. NatL Acad, Sd. 
USA 895547-5551; Gossen, M. et aL (1995) Sdraice 268:1766-1769; Rossi, RM.V. and ELM. Blau 
(1998) Coir. Opm. BiotedmoL 9:451-456), commerdally available in the T-REX plasmid (Invitrogen)); 
the ecdysone-inducible promoter (available in the plasmids FVGRXR and PIND; Invitrogen); the 

30 EK506/rapam^ia inducAle prpinotCT; or the RU486/imfepristone inducible promoter (Rossi, P.M. V. 
and Blau, HLM. supral X or (iii) a tissue-spedfic promoter or the native promoter of the endogenous 
gj^ie encoding DMB from a nonnal individnal 

Commi^dalLy available iQxisome transformation Mts (e.g., ftie PERFECT LIPID 
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TRANSFECnON ETT, avaflable from Imdtrogen) allow 

polynucleotides to target cells in calture and require minimal effort to optimize e^)erimeiital 
parameters. In the altemative, transformation is performed using the calcium phosphate method 
(Graham, RL. and AJ. Eb (1973) Virology 52:456-467), or by electroporation (Neumami, E. et aL 

5 (1982) EMBO L 1 :841-845). The introduction of DNA to primary cells requires modification of these 
standardized mammalian transfection protocols. 

In another embodiment of the invention, diseases or disorders caused by genetic defects with 
respect to DME e^qntssion are treated by constructing a retrowns vector consisting of (i) the 
po^l^uclcotidc encoding DME under ^e control of an independent promote or the retromus long 

10 terminal repeat (LTR) promoter, (ii) i^ropriate RNA packaging signals, and (iii) a Rev-responsive 
el^nieat (ESE) along wi& additional tetrovinis d^-acting RNA seqaences and coding sequmces 
required for efBdent vector propagation. Rettoviras vectors (e.g., PPB and PFBNEO) are 
commeixdally avuhble (Stratagpne) and are based on published data (Riviere, L et al. (1995) Proc 
NatL Acad. Sci USA 92:6733-6737), incorporated by refoence herein. The vector is propagated in 

15 an qifTOpriate vector ptodncmgcdl line (VP^ an envelope gene with a tropism for 

receptors on fiie target cells or a promiscaons envelope protein sadi as VSVg (Ann^tano, D. et aL 
(1987) J. Virol 61:1647-1650; Bcndta:, MA. et aL (1987) J. Ytxol 61:1639-1646; Adam, M.A. and 
AJ>. Milfcr (1988) L Virol 62:3802^3806; Dul T. et aL (1998) J. Vnol 72:8463-8471; Znfferey, R et 
al (1998) J. Virol 725873-9880). U.S. Patent Number 5,910,434 to Rigg CMefliod for obtainmg 

20 retrovirus packaging cell lines producing high transducing ef&dency retroviral supernatant") discloses 
a mefliod for obtainmg retroviras paclcaging ceU lines and is 1»^ 

Propagation of retrovinis vectors, transdoction of a population of cells (e.g;., C!D4'^ T-cells), and the 
retnm of transduced cells to a patient are procedures well known to persons skilled in the art of g«ie 
flierapy and have been well documented (Ranga, U. et al (1997) J. Virol 71:7020-7029; Bauer, G. et 

25 al (1997) Blood 89:2259-2267; Bonjaiadi, MX. (1997) L Virol 71:4707-4716; Rang», U. et al (1998) 
Proa Natl Acad. Sd. USA 95:1201-1206; Su, L. (1997) Blood 89:2283-2290). 

In &e altemative, an adenoAdms4>ased gene dacaapy de£very syst»n is used to deliver 
poj^nucleotidcs encoding DME to cells whidi have one or more genetic abnormalities wifli respect to 
die expression of DME. The construction and packaging of adenovirus-based vectors are well known 

30 to diDse widi ordinary ^Dl in the art- Replication defective adenovims vectors have proven to be 
v^satile for importing gqnes encoding iirmmnoregalatory proteins into intact islets in the pancreas 
(C:s^, et al (1995) Transplantation 27*^63-268). Pot^tially usefol adenoviral vectors are 
described in U.S. Fiatent Nurhber 5,707,618 to Armentano (''Adenovirus vectors for gene tiierapy**). 
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hereby incoiporated by leferaice. For adenoviral vectors, see also Aminozzi, PA. et aL (1999) 
Anmi. Rev. Nutr. 19311-544 and Venna, LM. and N. Soniia (1997) Nature 18389:239-242, both 
incorporated by reference h^rdbtt. 

In another alternative, a herpes-based, gpne tberapy deliveiy system is used to deliver 

5 polynncleotides encoding DME to target cells which have one or more genetic abnormalities with 
respect to the e3q>iesdon of DME. The use of herpes simplex virus (HSV>based vectors maybe 
especially valuable for introducing DME to celk of the central n^vons system, for which HSV has a 
tropism The construction and padraging of herpes-based vectors are well known to those with 
ordinary skill in the art A replication-competent herpes simplex virus (HSV) type 1-based vector has 

10 been nsed to deliver a reporter gene to die eyes of primates (liu, X. et al (1999) Exp. Eye Res. 
169385-395). The construction of a HSV-1 virus vector has also been disclosed in detail in U.S. 
Patent Number 5,804,413 to DeLuca ("Herpes simplex virus strains for g^ne transfer"), which is 
hMeby incorporated by refearaace, U.S. Patent Number 5,804,413 teadies die use of recombinant 
HSV d92 wMdi consists of a gpnome containing at least one exogenous gene to be transfixed to a 

15 ceD under die control ofihe appropriate promote for purposes inclnd^ Also 
tau^ by this patent are the constroction and use of xeconobinant HSV strains deleted for ICP4, 
ICP27 and ICP22. For BESV vectors, see also Goms, W.F. ct al (1999) J. 73:519-532 and Xu, 
H. ct at (1994) Dev. Biol 163:152-161, hereby incorporated by reference. The manipulation of 
cloned hrapesvims sequences, the ^sieration of recombinant virus following the transfection of 

20 'mnltq>]e plasmids contaimng differrait segtm^ts of the larg^ herpesvirus genomes, ttie growth and 
propagation of herpesvirus, and flie infection of celb with herpesvirus are tedmiques well known to 
those of oidhiary sldll in die art 

In anoth^ altranative, an idphavims (positive, single-stranded RNA virus) vector is used to 
ddiver polynncleotides encoding DME to tar^ cells. Thebiology of the protolytnc a]^fchaviros, Semliki 

25 FoKsst Virus (SFV), has been studied extensively and graie transfer vectors have been based on the 
SFV gpnome (Garof^ tt and K-L li (1998) Curr. Oirin. Biotechnol 9:464-469). During a^virus 
RNA replication, a subg^mic RNA is gem^rated idiat nonnally encodes the viral c^sid proteins. 
This sub gjsaomic RNA replicates to higher levels dian the foil lengdi genomic RNA, lesulthig in the 
overproducdon of capsid protdns leladve to the viral proteins with enzymatic activity (e.g., protease 

30 and po^merase). Similar!!^, inserting the coding seqneiK^ for DME into the alphav^ 

place of Ihe c^psid-coding region results in die production of a large number of DME^coding RNAs 
and flicqinfliBsisofhi^ levels ofDME in vector transduced ceBs. Whib alphavims infection is 
typically assodated widiceHlyds within a few days, the ability to establish a persistMit infection in 
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hamster Dormal kidney cells (BHK-21) with a variant of Sindbis virus (SIN) indicates lhat the lytic 
replication of al^avLruses can be altered to suit the needs of the gene therapy appEcation OOryga, 
S.A. et al (1997) Virology 228:74-83). The wide host range of alphavirases win allow the 
introduction of DME ruto a variety of cell types. Hie specific transduction of a subset of cells in a 

5 population may require the sorting of cells prior to transduction. The methods of manipulating 
infectious cDNA clones of alphaviruses, performing a^^havirus cDNA and RNA transfbctions, and 
performing alphavirus infections, are well known to those with ordinary sSsjH in the art 

Oligonucleotides derived from &e transcription initiation site, e.g., between about positions -10 
and +10 from the start site, may also be employed to inhibit gene expression. Similarly, inhibition can 

10 be adiievcd using triple helix base-pairing methodology. Triple helix pairing is usefiil because it causes 
inhibition of the ability of the double helbc to open snfBcnently for the binding of polymerases, 
tnmscriplion factors, or regulatory molecoles. Rec^ therapeutic advances using triplex DNA have 
been described in tibe Hteratuie. (See, e.g., Gee^ J.B. et aL (1994) in E&iber, B.R and B.L Catr^ 
Motecnlar and Irnnninologic AppgDaches, Fdtora Publishing, Mt Kiaco NY, pp. 163-177.) A 

15 cGmplcmjcntary sequ^ce or antisense molecule may also be designed to block translation of mRNA 
by pareventing the transcript frombinditig to ribosomes, 

• Ribos^yines»enz3f3naticRNAmolBcn]es,'may akobeusedtoca^ 
RNA. Themediankmofriboscynie action involves sequencer 

tnolecule to coni^lcincntaxy target RNAyfoBowed by endonuclcol^ For example, 

20 engjmeered hammratead motif ribozyme molecuks may specifically and efficiently catalyze 

endoimcleolytic cleavage of sequences encoding DMB. 

Specific ribozyme cleavage sites within any potential RNA target are initmOy identified by 

scannuig the taigpt molecab for ribozyme cleavago sites, including ^ following sequotices: GUA« 

GUU,andGUC. OlK»idaittfied,shDrtRNA sequences ofbetwecn IS and 20 ribo^ 
25 corvBsponding to the rc^on of tibe tar;^ geiie contaming tbe deavi^e site, may be evaluated for 

secqndaiy stnictnral features wM(^ may render the oligonnc^ The suitatnlity of 

candidate targ^ may also be evahiated by testing accessibility to hybridization wifii compl^neaitary 

oligonucleotides usit^ ribonuclease protection assays. 

Conxplementary ribonucleic acid molecules and ribozymes of the invention may be pr^ared 
30 by any medipd !kiiown m the art for the synfliesis of nucleic acid molecules. These include tedmiques 

for chemically syndiesiztng oligonucleotides sudi as solid phase phosphoramidite chemical synthesis. 

Ahcmstivciy, RNA molecules may be generated by in vitro and in vivo transcri{^on of DNA 

sequences encoding DME. Such DNA sequoices may be incorporated into a wide vari^ of vectors 
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with suitabb RNA polymerase promoters such as T7 or SP6. Atetnatively, ftiese cDNA constructs 
that syndiesize coii]|>lem^tary SNA, constitutively or mducibly, can be introduced into cell lines, cells, 
or tissues. 

RNA molecules maybe modified to increase intracellular stability and half-1^ Possible 

5 modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3 ' ends 
of the molecule, or the use of phosplK)ro1hioate or T 0-methyl raflier than phosphodiesterase linkages 
within the backbone of die molecule. This concept is inhereait in the production of PNAs and can be 
extended in aH of these molecules by die inclusion of nontraditional bases such as inosine, queosine, 
and wybutodne, as well as acetyl-, meth}^-, duo-, and similartjr modified forms of ademne, cytidme, 

10 guanine, thymine, and uridine which are not as easily recognized by endogenous ^donucleases. 

An additional embodiiDcnt of the invention ^ocon^asses a incdu>d for screening for a 
conqiound which is e^ctive in altiering e^qxresapn of a polynucleotide encoding DMR ConqMHinds 
wludi be efifectiv& in alt^ing esqiression of a specific polynucleotide may include, but are not 
liEiaited to, ofigonucleotides, antisense oligonncleotides, 

15 transcrqition foctors and otiier polypeptide transcdptional regolators, and non-mactomolecular 

diemicaleiitities wbicfa are capdile of interacting with Effix:tive 

, compounds-may alter polymxc]eotid&e3q>res5ion by actfaag as diher inhibitors or^ 

polynucleotide expression. Thas,inthetreatna^of disorders assodated with increased DME 
.e3i|>ression or activity, a conqioiind whicb speci&ally iiihibits expression of the polynucleotide 

20 m»ding DMB may be therapeuijcaOy useful, and in tiic treatment of disorders associated widi 

decreased DMB eiqiressioti or activity, a coxnpound whidispedficaHy promotes esqpression of the • 
polynucleotide encodingDME maybe tiii^^i^iticaLnyiisefuL 

At least one, and up to a plurality, of test compounds maybe screened for eSsctiveness in 
alt^dng expression of a spedficpol^cleotide. A test con^undnoay be obtained by any xneOiod 

25 cammonly known in the art, includiiig dhemical modiBcation of a compound known to he effective in 
alt^cmg polynucleotide expression; selection from an existing, commercia%-avai]able or proprietary 
library of naturaUy-occuning or non-natural cb^cal compounds; rational design of a compound 
based on dioooical and/or structural properties of the target polynucleotide; and selection fit)m a 
library of chemical compounds created combinatorially or randomly. A sample comprising a 

30 pqlyuucbotide encoding DMB is exposed to at ^ast one test compound thus obtained. The sample 
naav comprise, for example, an intact or p^meabilized cell or an in vitro ceD-free or reconstituted 
biodiemical system. Alterations in die expression of a polynucleoti^ encoding DME are assayed by 
any mediod commonly known in the art T^ica%, die expression of a spedfic nucleotide is detected 
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by hybridization with a probe having a inicleotid& sequence complementaiy to the sequence of the 
polynucleotide encodmg DME. The amount of hybridization may be quantified, thus forming die basis 
for a comparison of the expression of the polynucleotide both with and without exposure to one or 
more test compounds. Detection of a change in the expression of a polynucleotide e^sed to a test 

5 con^K>und indicates that the test compound is eflS^ctive in altering the expression of the polynucleotide. 
A screen for a con^round effective in altering expression of a specific polynucleotide can be carried 
out, for exan^k, using a Schizosaccfaaiomvces Tx>nibe gpne expression syst^ (Atkins, D. et aL 
(1999) U.S. Patent No. 5,932,435; Amdt, GM. et aL (2000) Nucleic Acids Res. 28.E15) or ahnman 
cell line sudi as HeLa cell (Clarke, M.L. et aL (2000) Biochem. Biophys. Res. Comnnm. 268:8-13). 

10 A particular OTibodhnent of the preseiKt invention involves scieenfaig a combinatorial libraiy of 

oligonucleotides (sudi as deoxyriboBaideotides, libonndeotides, pqitide nucldc adds, and modified 
oligonucleotides) for antisense ^ittvity ag^nst a spedfic polynucleotide sequence (Bruice, T.W. et bL 
(1997) U.S. Patent No. 5,686^2; Broice, T.W. et aL (2000) U.S. Patent No. 6,022,691). 

Many methods for intiodaciiig vectors into cells or tissues are available and equals suitable 

15 for use in idvD, in vitro. atid csK vivo . Fbr«viw1hBr^, vectors niay be introduced fato 

taken fiom the patient and cbnally propagated for autologous transplant back into that same patient 
Defivety by transfoction, by Iqwsome injections, or by polycat^ . 
using metiiodsvrfridi are well kiM>wn in flie art (See, e.g., Goldman, C.K. et aL (1997) Nat 
Biotedmol 15:462-466.) 

20 Any of the tiu^-apeutic methods descri^ 

sndi therapy, inclodh^, for example, wiairmmlR sudi as humans, dogs, cats, cows, horses, rabbits, and 
monkeys. 

An additional canbodimcnt of tiie invi^Dn relates to the administratioiL of a conyposition whidi 
g^ralfy conqirises an active iQgcedi^ formulated witii a phatmacenticaHy acce^ble exdpient 
25 ExcipioDte may include, for example, sugars, starches, ceDutoses,^ Various 
fornmMom are oomsnohl^ kiiown and are thoroughly dis^ 

PhnrmiiMi tical Sdcnccs (Mmck PubMapg, Baston PA). Such compositions may coaisist of DMB, 
antibodies to DME, and mimetics» agpnists, antagonists, or inhibitors of DME. 

The con^KJsitions utilized in this invention may be administered by any number of routes 
30 including, but not limited to, oral, intravenous, intramuscular, intra arterial, intramedullary, intrathecal, 
intraventricular, pulmonary, transdeimal, subcutaneous, intraperitoneal, intranasal, enteral, topical,, 
subhngaal, or rectal means. 

Conipositions for puhnonaiy admimstration may be prepared in liquid or dry powder form. 
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These compositions are g^ially aerosoBzcd immediatdy prior to inhalation by the patient In tiie 
case of smaU molecules (e.g. toditional low molecular weight organic drugs), aerosol delivery of fast- 
acting formulations is well-known in the art. In the case of macromolecules (e.g. larger p^tides and 
proteins), recent developments m tiie field of pulmonary delivery via tiie alveolar region of the hmg 

5 have enabled Ihe practical deHv^ of drags such as insulin to blood drculation (see, e.g., Patton, J.S 

et aL, U,S, Patent No. 5,997,848). Puhnonary delivery has the advantage of admmistration wifliont 
needle injection, and obviates tihe need for potentially toxic pen^xation enhancers. 

Compositions suitable for use in the mvention include compositioi^ whermn the active 
ingredients are contained in an effective amonnt to achieve the i^ The detmmnation 

10 of an effective dose is well wilhin tibe capability of fliose sldUed in the art 

Spcdalized forms of compoations may be prepared for direct intiacelhilar deSveiy of 
mactomolecoles comprising DME or fragmesits thereof. For exan^le, liposome preparations 
containing a cellriiiq>eim©ab]e macromolecale may promote cefl fusion and intracelhilar defiv^ of flie 
macromolecijle. AltMiativety,DMEora£ragp(ientthercof nwybejo^^ 
15 teniunalportionfix)mtiiBHIVTal-lptotrin. Fdsionpiotrins fens genaated have beraifonnd to 
transduce into flie cells of all tbsnes» mcfaidmg Ae brain, in amouse model system (SAwaize, SJL ^ 
aL (1999) Sdence 285:1569-1572). 

For any con^raid, thethEarapeuticaDy effective dose can be estimated hutially dfher m cell 
cutaire assays, e.g., of neoplastic ceDs> or in animal models such as mice, rats, rabbits, dogs, monkeys, 
20 orpi^. Ananimalmodelmayalsobensedtodeterminetheapproprialeconcen^ 

lonteofadmmistradon. Sndimfonnation can tihen be used to detennine useful doses and routes for 
adnnnistration in humans. 

A tiierapeotically efiective dose idfars to fhat amount of active ingredient, for example DME 
or fiagments&eieot antibodies of DME, and sagonists, antagpnists or mhibitors of DME, which 
25 amelioiates the symptoms or condition- Therapeutic efficacy and toridly may be detemimed by 
standard phannaceuticsd proceduies in ceh cultures or witii ej^ 

cakolating die ED50 (tiie dose flieriq)euticany effective in 50% of the population) or LD50 (the dose 
lelhalto 50% of the population) statistics. The dose ratio of toxic to ti^rapeutic effects is the 
tiierqieutic index, which can be expressed as die LD^J^so ^tio. Compositions which exhibit large 
30 therapeutic mdices are prefbired. The data obtamed from cell cuHnre assays and animal studies aie 
used to formulate a range of dosage for human use. Tho dosagp contained in sudi compositions is 
preferably witim a range of dicukting coiK:aitrations tiiat mchides the ED50 wifli little or ho toxicity. 
The dosa^ varies within this rapgp depending npon die dosage form ranployed, tiie sensitiviQF of die 
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patient, and fte mute of adiniustr^iL 

The exact dosage wiB be detennined by th& practttioi)^, in Hgbt of factois related to flie 
subject requiiing tteatineiit Dosi^ and adnmiistratioii axe adjusted to provide sof&ci^t levels of the 
active DDM>icty 01 to tnaintainflie desired rffect Factois wUch maybe taken into accoutitiDchide die 

5 severity of the disease state» the general health of the subject, the age, weight, and gender of the 

subject, time and frequency of administration, drag conibination(s), reaction sensitivities, and response 
to tbi^py. Long-acting compositions may be administered every 3 to 4 days, every week, or 
biweekly depending on die half-life and clearance rate of the particular fDrmulation. 

Normal dosage amounts may vary from about 0.1 /ig to 100,000 fz^ up to a total dose of 

10 about 1 gram, d^nding upon the route of administration. Guidam^e as to particular dosages and 
methods of deSvery is prodded in the literature and generally available to praclitioneis in die art 
Those skilled in die art will employ different formulations fDr nucleotides than for proteins or their 
inhibitors. Similarly, deliv^ of polynucleotides or polypeptides will be specific to particular cells, 
conditions, locations, etc. 

15 DIAGNOSTICS 

Lianodier^bodhni^ antibodies wMchspediEicany bind DMB may be used for die diagnosis 
of disorders diaracteized by ^ression of DME, or in assays to monitor pati^its bdng treated widi 
DMBoragonists,antagpn]sts, or inhibitors of DME. Antibodies useM for diagnostic purposes may be 
pr^iared in die same manner as described above for th^peudcs. Diagnostic assays for DME 

20 include mediods which utifize die antibody and a label to detect DME in human body fluids or in 
extracts of eels or tissues. Tbe antibodies inay be used widi or widiontnKM!]ficadoii,a^ 
labeled by ooval^ or non-covahait attadiment of a r^xirter molecule. A wide variety of reporter 
molecules, sevml of wHcb are described above, are known in the art and may be used. 

A variety of protocols for measuring DME, indnding ELISAs, RIAs, and FACS, aie known in 

25 die art and provide a bads for diagnofiiiigait^cd or abiKirnoal levels of DME ex^ Normalor 
standard values for DME expression are establisbed by coml»iiiiig body ^ 
:&Dm normal mammalian subjects, for exanqile, fanxnan subjects, widi antibodies to DME under 
conditions suitable for complex formadoit Tbe amount of standatd complex formation may be 
quantitated by various methods, sudi as photometric means. Quantities of DM£ expressed in subject, 

30 control, and disease sanoples foam biopsied tissues are compared with the standard values. Deviation 
between standard and subject values establishes die parameters for diagnosing disease. 

In anodier embodim^ of die invention, the polynucleotides encoding DME may be used for 
diagnostic purposes. The polynucleotides whkdi may be used include ohgonuckotide sequ^ces. 
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compleuK^itaiy RNA and DNA molecules, and PNAs. The polyoacleotides may be used to detect 
and quantify gene expression in biopsied tissues in which expression of DME may be correlated with 
disease. The diagnostic assay maybe used to detennine absence, presence, and excess expression of 
DMEy and to monitor regulation of DME levels during therapeutic intervention. 

5 In one aspect, hybridization with PGR probes which are capable of detecting polynucleotide 

sequences, including genomic sequences, encoding DME or closely related molecules maybe used to 
identify nucleic add sequences which encode DME. Thespedfidtyof the probe, whether it is made 
from a hi^ily specific region^ e.g., flic 5'regulatory region, or from a kss specific region, e.g., a 
conserved motif, ai^ flie stringency of die hybridization or amplification will detennine whethor the 

10 probe identifies only naturally occurring sequooces encoding DME, allelic variants, or related 
sequences. 

Probes may also be used for die detection of related sequences, and may have at least 50% 
sequence ideiitity to any of tho DME encoding seqneiices. The hybiidizatton probes of die subject 
inventionmay be DNA or KNA and maybe derived fiom &r sequence of SEQ ID NOt20>38 or from 

IS graionucseqn^ices including ptomoters, enhance 

Means for pxoducmg spcdSc hybxtdization probes for DNAs eiicoding DME include the 
. cfeiiing of polynucleotide sequraices flooding DME or DME derivati^ 
production of miRNA probes. Su(& vectors are known in the art, are cammercial(^avfflla^ 
be used to synfliesize RNA probes in vitro by means of die . addition of die appropriate RNA 

20 :poI)^merases and teappn)pxiate labeled imcleot^ Hybtidizationprobesniaybelabeledbya 
variety of r^rfcer groups, for exan^le, by radionuclides such as ^ or ^» or by enzymatic labels, 
sudi as alkaline ihosphatase coupkd to die probe via aii 

Fo^nndeodde sequeaoces encoding DME may be used for tihe diagnosis of disorders 
associated wiflxei^ression of DME. Exanqylesof sochdisord^indhide,butaren0tliiDitedto^ 

25 autoimmune/Snfiammatory disord^, sodi as acquired irmmmodeficieiicy syndrome (AIDS), Addison's 
disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, am^idosis, anemia, 
asthma, adieroscl^osis, autoimmune hemolytic ani^ma, autoincmnme thyroiditis, autoimmune 
polyi^docnnopadiy-^^andidiasis'-ectodermal dystrophy (APECED), bronchitis, cholecystitis, contact 
dffluatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes meHitus, ^physema, ^isodic 

30 lymphopenia with lymphocytotoxms, ^7&roblastosis fetalis, erythema nodosum, ^ophic gastritis, 
glomerulonephritis, Goodpasture's syndrome, gout. Graves' disease, Hashimoto's thyroiditis, 
hypereosinophifia, irritabk^ bowel syoidrome, moiltiple scl^t>sis, myasd^oua gravis, myocardial or 
pericardial inflammation, osteoarttnitis, osteoporosis, pancreatitis, po^^yositis, psoriasis, Reiter's 
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syndrome, dieunmtoid arthritis, scleiodc^rma, Sjogren's syndrome, systemic anaphylaxis, systemic 
hipas erythematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative colitis, uvdtis, Wemer 
syndrotoe, complications of cancer, hemodialysis, and extracorporeal circulation, viral, bacterial, 
fnngal, parasitic, protozoal, and helminthic infections, and trauma; a cell proliferative disorder, sudh as 

5 actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective tissue 
disease (MCTD), myelofibrosis, paro^smal nocturnal hemoglobinuria, polycyltemia vera, psoriasis, 
primary thtombocythemia, and cancea:s including adenocarcinoma, leukemia, ^^mphoma, melanoma, 
myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of the adrenal gjand, bladder, bone, 
bone marrow, brain, breast, cervix, gdl bladdea-, ganglia, gastrointestinal tract, heart, kidney, liver, lung, 

10 muscle, ovary, pancreas, parathyroid, pems, prostate, salivary glands, skin, spleen, testis, thymus, 
thyroid, and uterus; a developmental disoider, such as renal tabular acidosis, anemia, Cashing's 
syndrome, achondroplastic dwarfism, Ducliemie and Becker muscular dystrophy, epilepsy, gonadal 
dysgenesis, WAGR syndrome (Wilms* tumor, aniridia, genitourinaty abnormalities, and mental 
retardation), Snodth-Mag^nis syndrome, myclodysplastic s^drome, herciStary mucoepitiiefial dysplasia, 

IS bcare^tazy keratodennas, Ineditaty nenxopathies sndi as CharcQt-Mari&-Toodi disease and 

ruenrofibtomatoias, hypotibyioidisni, ]iydn>ce|3)ahs, sdznre disorders sudi as Syndetdham's diorea and 
. c^iebral palsy, spina bifida, anenccphaly, cramoraduBdnsis, congenital glaucoma, cataract, and 
sensorineural bearing loss; an ^idocrine disordea*, such as disorders.of the hypotibalamus and pituitary 
resnldng from IbsIods sndi as primary brain tumors, ad^omas, infarction assodated with pregnancy, 

20 hypophysectomy, aneurysms, vascular malfoimations, thrombosis, infections, immunological disorders, 
and compfications due to bead ttaoma; disordss associated with hypopituitarism incbding 
hypogpnadian, 9iedhan syndrome, diabetes instpidus, Kalhnan's disease, Hand-^dnill&r-Oirisrian 
disease, Lettmr-Siwe disease, sarcoidosis, empty sella syndrome, and dwarfism; disorders associated 
widi hypefprtnitarism mft tiding acromegaly, g|anti«gF»^ and syndrome of inappropriate antidiuretic 

25 bonnone (ADH) secretion CSIADH) ofi^ caused by benign ad^oma; disorders associated widi 
bypofliyiDidism including goiter, myxed^na, acute thyroiditis associated with bacterial infection, 
subacute 1]i3rroiditis assodated with viral infection, antormmune thyroiditis (Hashimoto's disease), and 
cretinism; disorders associated with hyperthyroidism including thyrotoxicosis and its various forms. 
Grave's disease, pretibial myxedema, toxic multinodular goiter, thyroid cardnoma, and Plummer's 

30 disease; disorders assodated with hyperparathyroidism including Conn disease (chronic 

bypercabmia); panoreatic disorders sudi as Type I or Type n diabetes melttns and assodated 
complications; disorders assodated widi the ^renals sudi as hyperplasia, cardnoma, or adenoma of 
die adrenal cortex, hypertension assodated widi alkalosis, amyloidosis, hypokaleaua, Cushing's 
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diseMcIjddfo's syndrome, and Amold-Healy-Goidon syndrome, jjieochromocytoma tumors, aad 
Addison's disease; disorders assocaated wWi ^nadal steroid honnones such as: in women, abnormal 
prolactin production, infertility, endometriosis, perturbations of the menstrual cycle, polycystic ovarian 
disease, hypraprolactincmia, isolated gpnadotropba deficiency, amenonJiea, galactorrhea, 
hemsqilirodilism, Misatism and virilization, breast cancer, and. in post-menopausal women, 
osteoporoas; and, in men, Leydig cell defidency, male climacteric phase, and germinal cell aplasia, 
l^p^nadd disordeis assodated wifli Leydig cdl tumors, androgai resistance associated widi 
absence of androg^ rec^tors. syndrome of 5 a-iednctase, and gynecomastia; an eye disorder, sudi 
as conjunctivitis, keratocoiyunctivitis sicca, keratitis, episcleritis, iritis. postMior uveitis, glaucoma, 
amaurosis fugax, ischemic optic neuropafliy, optic neuritis, Lebff's hereditary optic neuropalhy, toxic 
optic neuropathy, vitreous detachment, retinal detachment, cataract, macalar degsaiMation, central 
sMous chorioretinopathy, retinitis pigmentosa, meknoma of fl» dioroid, retrobulbar tumor, and 
chiasmal tumor, a metaboKc disorder, sudi as Addison's disease, ceicbrotcndinoas xanthomatoas. 
congenital adrraml hyperplasia, coamarinreastanw, cgistic fi^^ 

firoctose-1.6-dii4i0sphatase defidcncgr, gptoctosemia, gpitear, gtacagMKHna. g(ycogwi stM^ diseases. 

heaeditary fturtose intolerance, lOT««*w»afis™'^y^^ 
hypoparafliyroidism, Iweichotestawlania, hypedhyrmdism, hypo^ycem^^ 

hypedipidemia. hyperlqiemia. %id myopathics, lipodystrophies, lysosomail storage diseases. Maikes 

syndrome, ocdpatdhoni syndronje, mannoadoas, neairamin^ 

phenynoetamiria, pseadovitamin D-defidoicy rickets; and a gastrointestinal disordea:. sudi as 
dysphagia, pq?tic esoiOiagitis, esi^Jiagpal spasm, esojha^ 
dyspqpsia, indigBsdan, gastritis, gastric carcinoma, anwcsria^ 

pylorir f-df-mff, aHV«"i»al angina, pyrwas. pastroenteiljs. intestinal obstruction, irfectinns of fee 
hdestinal tract, peptic Tdcw, diokOifliiasis, diofccyrtilis. dM)lestasis.pancreatids, pancreatic cardnoma, 
biHaiytnict disease. hBpatiti6,hyp«bffiiiihinBmia, hereditaiyhyperbiliiabinemia, drdiosis, passive 
con^stimi of flie five*. hq>atoma, infectious cofitis, nkeacative coHtis, nfcerative proctitis, Oohn's 
disease, mipple's disease, MaBwy-Weiss qiodrome, cokiric cardnoma, cotonic obstroction, irritable 
bowel syndrome, shortbowel syndrome, dianhea, constipation, gastromtestinal hemonhage, acquired 
immnnodefideaK^ qmdrome (AIDS) eatoropafliy, jaundice, hepatic encq)halopathy, hepatorenal 
I syndron»,hQ)atK: steatosis, hcanochromatosis, Wilson's disease, alphai-antitrypsin defidency, Reye's 
syndioOK, primary sderosing dmlangitis, liver infarction, portal vein obstruction and flmimbosis, 
ceaotribbular necioas, pdiDMshepatis,hq>atic vdn flirombosis. veno-ocdusive disease, preeclampsia, 
edampda, acute fetlylivCT of piegnancy.inlrahepatic d>ol^ 
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iticIadiQg nodular hyperplasias^ adenomas, and carcinomas. The polynucleotide sequences encoding 
DME may be used in Sou&i^ or noithmi analysis, dot blot, or o&er iDembrane4)ased technologies; 
in PGR tedmologies; in dipstick, pin, and multiformat EOSA-Hke assays; and in microarrays utili^ng 
fluids or tissues fit>m patients to detect alb^ed DME esipresdoi^ Such qualitative or quantitatiye 
5 methods are well known in the art 

In a particular aspect, ih& nucleotide sequences encoding DME may be useful in assays ihat 
detect the presence of assodated disorders, particularly those mentioned above. The nucleotide 
sequences encoding DME may be labeled by standard methods and added to a fluid or tissue sample 
from a pati^ under conditions suitable for the formation of hybridization cori[Q>lexes. After a suitable 

10 incubation period, the san^le is washed and die signal is quantified and compared with a standard 
value. If the aniount of signal in the patient san^le is sigoificanttyal^^ 
8aaq>]e ttieai the presence of altered levels of imcleotide sequences encoduig DME in the sample 
indicates die preseaioe of the assodated disorder. Sudi assays may also be used to evaluate die 
efficacy of a particular therapeutic treatment regimen in animal studies* in clinical trials, or to monitor 
.15 the treatment of an indiiddualpatieait 

In ord^ to proidde a bads for the diagnosis of a disorder assodated ^vilh expresdon of DMB^ 
a normal or standard profile for e:;qxi:ession is establi^^ This maybe accompEsbed hy combining 
body fluids or cell extracls taken fiom normal subjects, ddieraniaial or human, widi a sequence, or a 
fragment thereof, encodii^ DME, under conditions suitable for hybridization or an^ilification. 

20- Standard hyfaddization may be quantified by comparii^ die values obtam 

vabes fiom an ^xp&isni^ in which a known amount of a substantially porafied polynucleotide is used. 
Standard values obtained in this mann^ may be cow^pated with values obtaio^ &om samples fix>m 
patents who are symptomatic for a disord^-. Deviation fit>m standard vaihies is nsed to establish the 
presence of a disord^. 

25 OKic» the presence of a disorder is established and a tteatnaent protocol^ 

hybridization assays may be lepcetcd on a regular b asis to determine if the level of expression in the 
patient begins to approximate that which is observed in die normal subject. The results obtained fixim 
successive assays may be used to show die efficacy of treatment over a period ranging fiom several 
days to months. 

30 Wth respect to cancer, the presence of aaabnornial anaount of transcri^^ 

ov^^qxressed) in biopsied tissue fiom an indr^dual may indicate a predisposition for the development 
of the disease, or may provide a means for detecting the disease prior to the appearance of actual 
clinical symptoms. A more definitive diagnosis of this type inaytJlowhealdLprofessioii^ 
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preventative measnres or aggressive treatment Gaxfk&c fheieby preventing the developing or further 
progression of flie cancer. 

Additional diagnostic uses for oHgonacleotides designed from the sequences encoding DME 
may involve the use of PCR. These oligomers may be chemically synthesized, gen^ated 

5 enzymaticaDy, or produced invi&o- Oligomers wiH preferably contain a fragment of a polynucleotide 
encoding DME, or a fragment of a pofyoucleotide complCTaentary to the polytmcleotide encoding 
DME, and win be employed under optimized conditions for id^itification of a specific gene or 
condition. Oligomers may also be employed under less string^ conditions for detection or 
quantification of closely related DNA or RNA sequences. 

10 In a particular aspect, oligonucleotide primers derived fix>m the polynucleotide sequences 

encodii]g DME nmy be used to detect single nucleotide polymoTphisnis(SNPs). SNPsare 
substitutions, insertions and deletions that are a fireqo^t cause of inhOTted or acquired genetic disease 
in humans. Mefliods of SNP detection include, but are not limited to, single-stranded conformation 
polymorphism (SSCP) and fluorescent SSCP (fSSCP) methods. In SSCP, oligonucleotide prim^ 

15 dmved from die polynucleotide sequences encoding DME arc 

polymerase dudnieafction (PGR). The DNA may be derived, for example, from diseased or normal 
tissue, biopsy sanqdes, bodily fluids, and flie Hke. SNPs in flie DNA cause difGe^nces in the 
secondary and tertiary structures of PGR products in single-stranded form, and these differences are 
d^ectable using gel electrophoresis in non-d^iaturing gels. infSCCP, the oligonucleotide primers are- 

20 fhiotescentLy labeled, which allows detection of ttie ampfimers inhigh-tiuou^iput equipm^ sudi as. 
DNAseqaeauangmadnnes. Additional^, sequeaice database analysis mel^^ 
(isSNP), aic capable of idend^ing po]|ymoiphisms by compsaiaag &e sequence of individual 
ovedapiMiig DNA fragnients which assein^^ These computer- 

based mediods filt^ out seqaeoce variations due to laboratory preparation of DNA and sequencing 

25 mors using statistical inodeb and automated analyses of DNA sequeiK^dito]^^ Intiie 
altmiatEve, SNPs maybe detected and dbiaracterized by mass spectrom^iy using, for oxainpk>y the 
hi^ tfarou^iput MASSAKRAY system (Sequeaaom, Ina, San Diego CA). 

Methods which may also be used to quantify the e:spression of DME include radiolabeling or 
biottnylating nucleotides, coampMcation of a control nucl^c acid, and interpolating results from 

30 standard curves. (See, e.g.,Melby, P.C et aL (1993) J. Immnnoi Methods 159235-244; Duplaa, C 
et at (1993) AnaL Biodi^ 212229-236.) The speed of quantitation of muMpk samples maybe 
acceJcfatcd by rmming die assay in a Mf^-durou^ipnt fbnnat whi^ the oligomer or polynucleotide of 
int^est is {nesented in various dilutions and a spectrojiSiotometric or colorimetric response gives rapid 
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quantttatioii. 

la fiuthcr dnbodiments, oligonucleotides or longor fragniGnts derived from any of the 
potynucleotide sequences described herein may be used as elements on a microarray. The noicroarray 
can be used in transcriqpt imaging techniques whidi monitor the relative escpiession levels of large 

5 ntunbers of grates simultaneously as described below. The microarray may also be used to identify 
g^^ietic variants, mutations^ and polymorphisms. This information may be used to determine gpne 
function, to understand the genetic basis of a disorder, to diagnose a disorder, to monitor 
progression/regression of disease as a function of gene expression, and to develop and monitor liie 
activities of &erapeutic agents in the treatment of disease. In particular, tbis information may be used 

10 to develop a pharmacograiomic profile of a pati^t in order to select the most appropriate and effective 
treatment regimeaa for that patienL For exanq>le»dierapeutic agents wMch are highfyefifec^^ 
display ttie fewest side effects maybe selected for a patieait based on bis/her pharmacogenotnic 
profile. 

In anodic ^bodimeiit, DMB, firagmieiits of DME, or antibodies spedfic for DME maybe 

15 nsedaselesDvaxtsonamicmarray. The microairayxnay be used to monitor or measoicpi^^ 
inteiactionsydrug-taiigpt interactions, and g^:iBespre^ 

A particular embodtmient relates to tfie use of the polynucleotides of die present invention to 
generate a transaiptinoage of a tissue or cell type. A transcript imagprqpres^its the global pattern of 
g^e egression by a partLcular tissue or cell type. Global g^e expression pattens are analyzed by 

20 quantifying the nmnb^ of e^qxressed genes and thdr relative abundance xmder given conditions and at 
a giveii time. (See Seilhami^ et al, "Coii4>aiative Transcript Analysis/' U.S. Intent Nmnb^ 
5,840,4M,e3presstyincoiporatedbyrc^reaiceberdiL) Tbus a transcrqit image anaybe generated by 
h>bridimig tine polynucleotides of tiie preseait inv^ranticm or tiieir complements to the totality of 
transcripts or reverse transciqrts of a particular tissue or ceHty^ In one embodiment, tiie 

25 hybridizatifin t^ik^ in bigh-thmn^pnt format, wherein the polynucleotides of the pres^ 
inventionortiiebconqxl^neintscanqitiseasabsef of aplura The 
resultant tram»ipt iniagp would provide a iMX>fiI& of g^ 

Transcript images may be generated using transcripts isolated ftom tissues, cell lines, biopsies, 
or other biological san^iles. Tbe transcript imagp may tiius reflect gene expression iavivo, as in the 

30 case of a tissue or biopsy sample, or in vitro, as in the case of a cell line. 

Transoipt images which profile tiie ejq)ression of the polynucleotides of the present invention 
may also be used in conjunction with in^dtro model systems and preclinical evaluation of 
jAuuXDaoeuticals* as weD as toncolo^cal testnqg of industrial and natoral^-occnrring environmental 
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compounds. All compounds induce diaractenstic g^e expiession pattejms, frequent^ tmood 
molecular fing^tpnuts or toxicant signatures, which arc indicative of mechanisms of action and toxicity 
(Nuwaysir, E-F. et aL (1999) Mol Cardnog. 24:153-159; Steiner, S. and NX. Anderson (2000) 
ToxicoL Lett 112-113:467-471, expressly incorporated by reference herdm). If a test compound has a 
5 signature similar to that of a compound with kaown toxicity, it is likely to share those toxic properties. 
These fingerprints or signatures are most useM and refined when they contain expression information 
from a large number of g^ies and gene families. Ideally, a genome-wide measurement of e:?qiression 
provides &ehi^)est quality signature. Even genes whose expression is not altered by any tested 
compounds are important as well, as the levels of expression of these genes are used to normalize the 

10 lest of ihe egression data. The iKmnalization procedure is useiul for comparison of expression data 
after treatment with dif^^nt compounds. While the assignment of gpne funcdon to elements of a 
toxicant signatoie aids in mfcrapielalion of toxidly medianisms, Imowledge of gene fimcdon is not 
necessary for tihe statistical matchiug of sigpatuics which leads to predictian of toxicity. (See,for 
exanqtb. Press Release 00-02 from tfie National Institnte of Environmental Health Sciences, released 

15 February 29, 2000, avmlable athttp://www.nidisj]ilLgQy/ocA}ews/tox€^.fatnL) Therefore, it is 
important and de^rabk in toxicological screening using toxicant signatures to include all expressed 
geaie sequences* 

. In 0!ne^nbodinasnt» die toxicfly of a test compound is assessed by tr^^ 
cont^ning nucleic acids with die test compound. Nucldc adds that are e::q>ressed in Ihe treated 
20 biolo^cal sanq>le are hybridized wifli one or more probes specific to the pol^ucleoti^ of the present 
invention, so fiiat transcr^fevels corresponding to die polynucleotides of the present invitation maybe 
quauti&d. IbetcanscKqptlevc^ in the treated biobgicalsan^^ 

untreated biological san^le. Differ^Kses in the transci^levds between die two samples ate 
indicative of a toxic response caused by die test con^Kiund in the treated sample^ 

25 Another particular ^bodiment relates to the use of the polypeptide sequ^ices of the present 

invention to analyze the pxoteome of a tissue or cell type. The term proteomereii^ to the global 
* pattern of protein expression hi a particular tissue or cell type. Each protein component of a proteome 
can be subjected individually to lurflier analysis. Proteome expression patterns, or profiles, are 
analyzed by quantifying flie nuinb«c of e:q)ressed proteins and their relative abundance under given 

30 conditions and at a given time. A prpfife of a cell's proteome may thus be generated by separating 
and analyzing tibepol^^ptides of a particular tissue or cell type. In one embodiment, die separation is 
adiieved using two-dimensional gel electrophoresis, in whidi proteins from a sample are separated by 
isoelectric focasing in the first dimi^i^on, and thea according to noofecular wd^t by sodium dodecyl 
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snlfete slab gel efecttophoresis in tbe second dimension (Steiner and Anderson, su^. The proteins 
are -visualized in flie gel as aisaete and uniquely positioned spots, typicaEy by staining the gel with an 
agoit such as Coomassie Blue or alver or fluorescent stains. The optical daisity of each protein spot 
is generaUy proportional to the lewl of fee protean in Ihe san^le. The optical densities of equivalently 

5 positioned protein spots from different saiiq)les, for example, from biological samples either treated or 
untreated with a test compound or flierapcutic agent, are compared to identify any changes in protein 
spot density related to Ihe trcabncait The proteins m flw spots are partially sequenced using, for 
example, standard methods enq»loying chemical or enzymatic cleavage folbwed by mass 
spectrometry. The identity of the proton in a spot may be determined by comparing its partial 

10 sequence, preferabty of at least 5 contiguous amino add residues, to fte polypeptide sequences of Ak 
present invcsution. Ih some cases, fiutosequesnce data may be olrtain<d forded 
idmtificatioa 

A proteomic profife may also be generated iiang antibodies spedfic for DME to quantify flie 
levels of DME e3q»rBSsioiL In one embodimeat, fte antibodies are 

15 andprotcanc3q«ic5sianlevdsarequantifBdby«sjj^^ 

flw fevds of pnifceinbonnd to eadi array dement CLuddng. A. et d. (1999) And. Biedan. 270:103- 
111; MMidoze,L.G.etaL (1999) Biotedmiques 27:778-788). Detection may be psrfarmed by a 
variety of metiiods known in flie art. for raampte, by reacting flie protdns in flie sanqile wifli a fliiol- or 
amino-reactive ftooiesaat om^KHmd and detecting a»e anc^^ 

20 efcanent 

Toxicant signatmes at die proteome tevd are also useful for toxtcological scieemng, and 
diodd be analyzed in pardldwiflitDMcaptdgnattires at the t^ Thaeisapoor 
oondationbetwe«itransaipt and protein dnmdances for some protdns in sometiaaies (Andason, 
NX. and J. Sdnwtner (1997) BbctiBphoieas 18-533-537), so proteome toxicant agnatures may be 

25 usefblintiKandydEofconapoundsii*ididonotsigdficanfl^ 

dt» tiie piolecmiic profile. Li adtftion, flie analyas of transcripts in body fluids is difficult 
degcadatitm of mRNA, so proteomic profiling may be more reliable and mformative in such cases. 

In anofljer axSoodaatisiA, flie toadly of a test conqwund is assessed by treating a biologicd 
san^fcocmtaiimg protdns wifli flie test compound. Proteins fliat arc expressed in flie treated 

30 biologicd sample are separated so fliat the amount of eadi protein can be quantified. The amount of 
eadi iwotein is compared to flie amount of flie corresponding protein in an untreated biologicd sanqde. 
A diffaence in flie amount of piotdnbetwerai flie two samples is indicative of a toxic response to flie 
l«5st confound in flic treated saniplB. Individnd protdns are identified by scquraidngflie amino add 
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residues of tlie individual proteins and con^aiing these partial sequences to tbe polypeptides of die 
present inTentiojQ. 

In another anbodiment, the toxicity of a test com^und is assessed by treating a biological 
sample containing proteitis with the test conoqpound. Proteins fix)ni die biological sample arc incubated 
5 wifti antibodies specific to die polypeptides of the present invention. Ihe amount of protein recognized 
by flic antibodies is qaandfied. The amount of protein in the treated biologica] sample is compared 
wilh die amomit in an untreated biological sample. A dijBisrence in the ainountofprotein between die 
two san^les is mdkative of a toxb respoi^ to flbc test conxpound in the treated sample. 

Microarrays may be prepared^ used, and analysed usmg methods bown in the art (See, e.g.y 
10 Biennan. TM. et aL (1995) U.S. Patent No. 5,474,796; Sch^ M. et aL (1996) Proc. NatL Acad, 
Sd. USA 93:10614-10619; BaldeschwdlEa: et al (1995) PCT application W095/251116; Shakm, D. et 
aL (1995) PCT application WQ95/35505; Heflear, R.A. et aL (1997) Proc NatL Acad. ScL USA 
94-^150-2155; and Bfelfcar, MJ. et aL (1997) U.S. Patent No. 5»605,662.) Various types of 
microarrays are woU known and thorou^dy described in PNAMicroarravs: A Practical ApptoadL 
. 15 M. Schena, ed. (1999) Oxford IMvi^tyPress^ London, b^y es^oesslyincoipojratedby jrefctenc^ 

M another embodiment of ttie inmition, nncldc acid sequences (»icoding DMB may be used 
to geneiratB hybridization probes nsefhl in mappiDg the. natural^ occurring gnomic sequ^ce. Eiflier 
cf>dingornDncoding seqaetK:6S]iiay be used^andinsonie instances 

preii^able over codtug sequences. For exanqiley conservation of a coding sequence among members 
20 of a muM-gene family may potential!^ cause undesired cross hybridization during chromosomal 
mapping. The sequences maybe mapped to a particular chiom^ 

duomosome, or to artifidal chromosome constractiDns, e.g., human artificial duomosomes (HACs), 
yeast artificial chromosomes (YACs), bacterial artificial Ghromosames (BACs)» bac^al PI 
constructions, or singje chromosome cDNA Hbraties. (See, e.g., Harrington, J J. et aL (1997) NaL 

25 15345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B J. (1991) Trends Genet 

7:149-154.) Once mapped, the nucleic add sequences of the invention may be used to develop 
genetic linkage maps, for example, wMdi correlate flie inheritance of a disease state wifli die 
inheritance of a particular chromosome region or restriction fragment lengfli polymorphism (RFLF). 
(See, for example. Lander, E.S, and D. Botstein (1986) Proc, Nad. Acad. Sci. USA 83:7353-73570 

30 Fluorescent in situ hybridization (FISH) maybe correlated with other i^ysical and genetic 

map data. (See, e.g,, Heinz-Ukich, et aL (1995) in Meyers, supra, pp. 965-968.) Examples of g^ietic 
map data can be found in varbus scientific journals or at die QnBne Mendelian Inkdtance in Man 
(OMIM) World Wide Web she. Correlation between (he location of tiie gene encoding DMB on a 
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l^ysical and a spedfic disorder, or a predisposition to a specific disorder, may he^ define the 
region of DNA associated with that disorder and thus may ftirth^ positional cloning efforts. 

In situ hybridization of chromosomal preparations and pli^cal mapping techniques, such as 
linkage analysis using estabKdied dironoosomal markers, may be used for extending genetic maps. 

5 Often the placement of a giMie on tiie chromosome of another mannnalian species, such as mouse, 
may reveal associated maikers even if the exact chromosomal locus is not known. This information is 
valuable to investigators searching for disease genes uiang positional cloning or other gene discoveiy 
techniques. Once the gpne or genes responsible for a disease or syndrome have been crudely 
localized by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to 1 lq22-23, any 

10 sequences m^ing to that area may represent associated or re^atory genes for further investigation. 
(See^ e.g., Gatti, ILA. et aL (1988) Nature 336:577-580.) The nudeotide sequence of flie instant 
invention may also be used to detect diffexeiK^es in the chromosonial location due to translocation, 
inversion, etc*» among normal» canti^, or afifected individoalB. 

In anofter embodimeDt of die invention, DMB, its catalytic or immnnog^c firagmcmts , or 

15 oligopeptides thereof can be used for screeoiiiglibratbs of 

scieeiiing tBchmques. The fragment esn^loyed in snch screening may be fi^i^ 

solid support, borne on a cell surface^ or located intraceBalarly. The formation of binding com|>lexes 

betwec^DME and tiie agpnt being tested may be measured-. 

Another tecfamque for dmg sa:eQD]ng provides for bigh Ihrou^^t screening of compounds 

20 ImvingsuitablcbindmgafBnilytotiieprDteinof inteirst (See,&^^ 

f^]icatianW084/Q3564.) In this mediod, large nniid>ai:s. of differ^ sniaU test com 
synfi^ized on a solid substrate. The test compounds are reacted with DMB, or ficagmraits thereof, 
andwadied. Bound DMB is detected by mediods well known in the art Purified DME can also 
be coated directly onto plates for use in tl)o aforementioned drug scieeii^ Altemativety^ 

2S nonrniEmtrailizjng antibodies can be used to capture die peptide and immobilize it on a solid support 
in anoHiBa: embodiment, one may use competitrve drug scae^omg assays in wMdi neutralizing 
antibodies capsibleofbinding DMB spedfbalycoinpetewiaL a test ^ M 
tiiis manner, antibodies can be used to detect the presence of any peptide which shares one or more 
antig^c determinants with DMB. 

30 In additional embodinients, the nucleotide sequences wM^ 

molecular biology techniques tiiathave yet to be developed, provided tiie new techniques rely on 
properties of nucleotide sequences that are currently known, including, but not limited to» such 
prop^es as tiie tr^l^ genetb code and spedfic base pair intra:^ 
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WtiiDTit further elaboration, it is b eKeved that one skflled in tbe art can, using the preceding 
description, utilize flie present inyoition to its fdDest extent Hie following embodSmoats are, therefore, 
to be constnied as merely ilhistrative, and not limitative of the remainder of the disclosore in any way 
whatsoever. 

5 The disclosnres of all patmts, applications and publications, mentioned above and below, 

incudingU.S. Ser. No. 60/223.055, U.S. Ser, No. 60/224,728, U.S. Ser. No.60/226,440, U.S. Ser. 
No.60/228,067, U.S. Ser. No.60/230,063, U.S. Ser. No.60/232,244, and U.S. Ser. No.60/234,269, are 
e3q)iessly incorporated by reference herein. 

10 

EXAMPLES 

L Construction of cDNA libraries 

Incyte cDNAs were derived ftom cDNA Hhtaries described in Ihe IJFESBQ GOLD 
datd>aseCIhcyte Genomics, Pab Alto C^) and diown in Some tisanes wore 

IS homogenized and lysed in gaanidiniam isothiocyanate, whilo odiets were homogenized and lysed in 
phenol or in a suitable mixtiDre of denatnrants, sucai as TRIZOL (Life TechnolDgies), a monopbadc 
solution of phenol and gaanidineisolfaiocyanate. Ihe resulting lysates were centrifoged over CsQ 
cushions or extracted with ddorofbmL lO^A was predfritatedfironi the lysates with dtheriso^panol 
or sodium acetate and ethanoU or by other routine notdhods. 

20 Phenol extraction and ptedpitation of R^lAwem repeated as nec^ 

puriQ^. In some cases, RNA was treated with DNase. F6rmostlibxaries,poly(A)4-RNA was 
isolated nsmg ohgp d(I>cou{ded paramaguelic particles (Promega), OLIGOTBX latex particles 
(QIAGEN.Oiatswor&CAXoranOUGOTEXmRNApurificadonK^ Alternatively, 
RNA was isolated drectfy ftom tissue lysates using od]» IQ^ isok^ 

25 POLY(A)PUR£nQRNApu]ificationlit(Anabion,An^ 

In some cases, Stratag^ was provided with SNA and constructed Ihe corresponding cDNA 
libraries. O&erwise, cDNA was synthesized and cDNA libraries wer^ 

UMZAP vector systxan (Stratag^) or SUPERSCRIPT plasmid systean (Life Technologies) , using 
the reconun^ided procedures or similar methods known in the art (See, e.g., Ausubel, 1997, supra, 
30 units 5. 1-6.6.) Reverse transcription was initiate using oligo d(T) or random primers. Synthetic 
oligoixttcleotide adapters w^ fignted to double stranded cDNA, and the cDNA was digested with die 
q>propriate restriction enzyme or enzymes. For most fihraries, the cDNA was size-selected (300- 
1000 bp) using SEPHAOIYL SIOOO, SEPHAROSB CL2©, or SEPH/^ 
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chromatography (Ameisham Phannada BiotecJi) or preparative agarose gel ekctrophoreas. cDNAs 
were ligated into compatible restriction enzyme sites of the polyfinko- of a suitable plasmid, e.g., 
PBLUESCRIPT plasmid (Stratagene), PSPORTl plasmid (Life Technologies). PGDNA2.1 plasmid 
(lavitrogen, Carlsbad CA), PBK-CMV plasmid (Stratag^e), PCR2-TOPOTA (fimtrogen), or 
5 pINCT(Incyte Genomics, Palo Alto CA), or dravativest^^ Recombinant plasmids were 
transformed into competent E>coli ceDs including XLl-Blue, XLl-BlicMRF, or SOLR from 
Stratagene or Dma, DHIOB. or Elw^MAXDmOB from lifc Tcc^ 

n. Isolation of cDNA Clones 

Flasmids obtained as descdbed in Esatx^le I were recovered from host cells by in vivo 

10 e3Kdsionnsii^fceXJNIZAPvBctorsystE«n(Stratag^)orby^^ Plasmids were pmifiednsing 
at least one of Hbo foHowing: a Magic or WIZARD Rfiniprcps DNA purification system (Promcga); an 
ACTC Miniprep punficatiDn kit (Edgp Biosystoois, Gaifiiersbnrg MD); and QIAWELL 8 Plasmid, 
QIAWELL 8 Phis Flasnud, QIAWELL 8 Ultra Plasmid paiificatLon systems or fiie JLE. AJL PREP 
96 plasmid purificadonldt from QIAGEN. Following pred^tadon, plasmids were resuspended in 0.1 

15 ml of distilkd watcar and stored, with or wi&out lyox^nlization, at 4 ''C 

. Altemalivdly, plasmid DNA was amplified from host cdl lysates usi^ 
M^i-throu^t fijrmat (Rao, V.B. (1994) AnaL Biodiem. 216:1-14). Host cell lyris and thermal 
cycling steps WCTe carried ont in a singb reaction mixhirei Samples were processed and stored in 
384-weIl plates* and the concentration of amplified plasmid DNA was quantified finorometiica]^ using 

20 PIC€KSREBN dye (Molecular Probes, Bag^ OR) and a FLUOROSES 
(Liibsystems ()y, Helsiiild, 1%^^ 
HL Sequencing and Analysis 

Incyte cDNA recoveared in plasmids as described in Exanqile E w&x> sequenced as fijllows. 
Sequcmdng reactions were processed using standard mediods or highL-ftroughput instromentation sudi 

25 as the ABI CATALYST 800 (Applied Biosystems) thmnal cyclca: or fb& FrC-200 fliemial cycler 
(MJ Reseaidi) in conjunction with the EWDRA ndcTodispenser (Robbie 
MICROI^ 2200 (Hamilton) fiquid transit cDNA sequwidng reactions were prqpared 

uang reagents provided by Amersham Pharmacia Biotech or supplied in ABI sequendng kits such as 
the ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Applied Biosystems). 

30 Electrophoietic separation of cDNA sequencing reactions and detection of labeled polynucleotides 
w^ carried out using tlie MEGABACE 1000 DNA sequencing system (Molecular Dynamics); flie 
ABI PRISM 373 or 377 sequencing system (AppHed Biosystems) in conjunction with standard ABI 
protocob and base calling software; or odiersequMK»analy^ Readmg 
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frames within the cDNA sequMKies were ideiitified usntg standard methods (reviewed in Ausubcl, 
1997, snpra. unit 7.7). Some of the cDNA sequences were selected for ext^ision using the 
tedmiques disclosed in Exan^le VIEL 

The polynucleotide sequences derived from Incyte cDNAs were validated by lemoving 

5 vector, Mcer, and poly(A) sequeaices and hy masking ambiguous bases, using algorithms and 

programs based on BLAST, dynamic progranmring, and dinucleotide neatest neighbor analysis. The 
Incyte cDNA sequences or translations tii«ieaf w«e dieii queried against a selection of public 
databases such as the GenBank primate, rodent, maTntnalian, vertebrate, and eukaryote databases, and 
BLOCKS, PRINTS, DOMO, PRODOM, and Mddeai Ma&ov model (HMM>-ba5ed protdn family 

10 databases such as FFAM. (HNfM is a probabilistic attach wMch analyzes oo 

stnictmes of gotic &ni£es. See» for exanq>le, Edity, S.R. (1996) Corr. Opin. Struct Biol 6361-363.) 
The qn^es wi»e peffotmed using programs based on BLAST, PASTA, BUMFS, and HMMER. 
The Incyte cDNA sequences were assembled to produce fall lengfli polynucleotide sequences. 
Alteinadvely, OenBank cDNAs, GenBank ESTs» sdtdbed sequences, stretched seqaooces, or 

1 5 Genscan-predicted coding sequ^aces (see Examples IV and V) used to cxtsoA. Incyte cDNA 
assemblages to fan l0DgflL Assembly was performed using programs based on Hued, Fhrap, and 
Consed, and cDNA assemblagos were scre^ied for open reading frames using programs based on 
GteneMark, BLAST, and FASTA. The falfa^ polynucleotide sequ^ces were Irandated to derive 
the corresponding-fan Ira^glh polypeptide sequences. Alteniatrv6ly» a polypeptide of the invention may 

20 begmatanyofthemednonine residues of fliefaniengditranslatedpoj^pq^ Pbnkngtb polypeptide 
sequences we^e subsequently analyzed by querying against databases such as die GenBank protein 
databases (gpiqwpt). SwissProt, BLOCKS, PRINTS, DOMO, PRODOM, Prosit©, and hidden 
Mad3oymodd(HMM>4Mised protein family databa^ Rdllengdi polynucleotide 

sequences are also analyzed using MACDNASIS FRO software (Hitadii Software Engmeering, 

25 South San RrandscoCA) and LASERGENE software (DNASTAR). Polynucleotide and pol^eptide 
sequence alignments are generated using default paramet^ specified by the CLUSTAL algorithm as 
incorporated into tiie MEGALIGN mnltisequeQce alignment jffogram (DNASTAR), whidh also 
calculates the percent identity between aligned sequences. 

Table 7 summarizes the tools, programs, and algorithms used for the analysis and assembly of 

30 Incyte cDNA and ftdl kngth sequences and provides applicable descriptions, references, and threshold 
parameters. The furstcolimim of Table? diows^ took, prograrns, and algorifhio^ 
column provides brirf descriptions thereof, the third column presents t^xopriate references, att of 
which are incorporated by reference hmin in their entirety, and the fourth cohmm presents, where 



90 



wo 02/12467 



PCT/DSOl/24382 



s^Iicable, the scotes, probabOti^ valaes, and o^r paratxieters used to evahiate the strengdi of a 
match betwe^ two seqaences (the lu^r tiie scoie or the lower probability value, die gceat^ the 
id^atity between two seqa^oces). 

The programs described above for the assembly and analysis of fuQ kngthpoIyuuckotLde and 

S polyp^de seqa^ces were also used to identify polynucleotide sequence fragments from SEQ ID 
NO:20-38. Fragm^its fiom about 20 to aboat 4000 nucleotides which are useful in hybridization and 
auj^lificodon tedbnolo^es are described in Table 4» cohinn 4. 
IV. Identification and Editing of Coding Sequences from Genomic DNA 

Putative dmg metsibolizi^g en2ymBs wese initially identified by numing the Genscan 

10 identification program against public g^mic seqa^ce databases (e.g., gbpri and gbhtg). G^iscan is 
a gesttend-purpose gene identification program wUdi analyzes gnomic DNA sequsoces £rom a 
variety of organisms (See Burge> C and S. KmSn. (1997) L Mol BioL 268:78^94. and Surge, C. and 
S. Karlin (1998) Curr. Opin. Struct BiaL 8:346-354). The program concatenates predicted exons to 
form an assembled cDNAsequ^ioe extending fiom a methionine to a stop codon. Theou^tof 

15 (Smcan is a FASTA database of polynucleotide and polypeptide sequences. The maximum range of 
sequence for Grenscan to analyze at once was set to 30 kb. To deterrmne which, of these Genscan 
predicted cDNA sequeiK^s encode drug metabolizing enzymes, the encoded polypeptides were 
analyzed by querying against PFAM models for drug metaboKzmg Potential drug 

metabolizii]^ ^szymes were also identijSed by homology to Incyte cDNA sequences that had been 

20 annotated as drug metabolizing enzymes. These selected Genscan-predicted sequences were then 
con^iared by BLAST ana^s to die g^ipept and gbpri pub£c databases. Where necessary^ the 
Gienscan-^predicted sequences were then edited by com^amon to die top BLAST hit from gienpept to 
collect errors in die sequetK^e predicted by (j^iscan, such as extr& or omitted exons. BLAST 
ansHysis was also used to find any Incyte c£>NA or public cDN A co vi^ragp of the Genscan-predicted 

2S sequences* dius providing evidence for transcdption. "Whenhu^ cDNA coverage was available, 
dns information was used to correct or confirm the Genscan predicted se^ Rdllengdi 
polyancleolide sequences wete obtained by assernbling Genscao-ptedicted coding sequences widi 
hacyte cDNA sequences and/or public cDNA sequ^ices usix^ fte asseoibVpi^K:^ desaibed in 
Esample 10. Aheanativdy, foil lenglh polynucfeotide sequences were dmved endrelbr from edited or 

30 unedited Gmiscan-predicted coding sequKsces. 

V* Assembly of C}enomicSequ»«:e Data with cDNASeqi^ 
"Stitched^ Semienccs 

Partial cDNA seqaences were eatoided widi exons pcedictedby die Genscan g^ 
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identification program desmbed in Example IV. Partial cDNAs assembled as described in Example 
in weaie inapped to g^ioiDic DNA and parsed into clasters contaio^ 

exon predictions &om one or more genomic seqaences. Each cluster was analyzed using an a]goiitfam 
based on gr^ili theory and dynamic programming to integrate cDNA and genomic information, 

5 generating possible splice variants that were subsequently confirmed, edited, or extended to create a 
fdH length sequence. Sequence intervals in which the entire leogth of the interval was present on 
more dian one sequence in the cluster were identified, and mtervals thus identified were considered to 
be equivalent by transitivity. For example, if an interval was present on a cDNA and two genomic 
sequences, tiben all tibree intervals were considered to be equivalent This process aDows unrelated 

10 but consecutive genomic sequences to be brought together, bridged by cDNA sequence, hitervak 
thus identified were ti^n "stitdbed'' togpth^ by the stitching algorithm in tibe order that they appear 
along their parent sequences to generate &e longest possible sequence, as well as sequence variants. 
Linkages between inte^irals which proceed along one type of parcmt sequence (cDNA to cDNA or 
genomic sequence to genomic sequence) were ^ven pretence ovor linkages which change par^ 

15 type (cDNA to genomic sequoQce). The lesuhant stitched seqneiices were tratislatB^ 

by BLAST analysis to ttie genpept and gbpri public databases. Jxncomct exons predicted by Genscan 
were collected by comparison to the top BI^ST hit from genpept Sequences were fiu&er extended - 
with additional cDNA scqa«K:es, or by inspection of getiomic DNA, when necessary. 
**StrctcheiF Sequences 

20 Partial DNA sequences were extended to fan lengfli with an algoritimi based on BI^ 

analysis. Firsts partial cDNAs assembled as described in Bxample HI weie queried against public 
databases such as the GenBahk primate, rod^it, mammalian, vertebrate, and eukaiyote databases 
using fte BLAST program Ihe nearest GenBaiik protein horiiolog was then coriqmred by BLAST 
analysis to diher Incyte cDNA sequ^ices or GenScanexon predicted sequences described in 

25 ExHtnplelV. A (diimj^ic protein was g^oefated by using the resultant Mg^^ 

(BSFis) to map tiie translated sequ^ices onto the GenBank prot^ homolog. Insertions or deletions 
nwy occur in the chirnericprot^wi& respect to the original Oe^ The 
GenBank jiroteni homolog, the chimeric protein, or botii vfGt& used as probes to search for homologous 
genomic sequences from the public human genome databases. Partial DNA sequences were 

30 thmfore ^^stretched" or ext^ided by the addition of homologpus genomic sequences. Ihe resultant 
stretched sequences were exarmned to detmnine whetii^ it contained a conqylete g^. 
VI. ChromoscMiial Mapping of DME Encoding Polynucleotides 

The seqoeaices which used to ass^nbk SEQ ID NO:2(>-3 8 were compared vidth 
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seqaeiices from flie Incyte DFESEQ database and public domain databases using BLAST and otiher 
implanentations of the Smifli-Watennan algoiidmL Sequences from fliese databases that matched 
SEQ ID NO:20-38 wa» assembled into clusters of contiguous and ovi^lapping sequences using 
assembly algortthms snch as Phrap (Table 7). Radiation hybrid and genetic mapping data avaflable 

5 from public resources such as the Stanfoid Human Genome Center (SHGQ 

Genome Reseaidh (WIGR), and Gen6thon were used to determine if any of the clustered sequences 
had been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the assignment 
of an sec[U«ices of that cluster, including its particular SEQ ID NO:, to that miap location. 

Map locations are represented by ranges, or intervals, of hnman chromosomes. The map 

10 position of an interval, in centiMorgans, is measured relative to the t^minus of the chromosorDe*s p~ 
arm. (The c^tiMorgan (cM) is a unit of measurement based on recombination frequencies betweeai 
chromosomal markers. On average, 1 cMisron^ilyequivaleiittD 1 megabase(Mb)of DNAia 
lininans,alfhou^thiscanvary widely due to hot amd cold spots of I The cM distances 

are based on gj&netic markers mapped by GSnSbon which provide boundaries for radiation hybrid 

15 markMS whose sequences were inctoded in each of fliedust^. Qumangpnomoim^ and other 
resources available to the pnbHc, sudi as the NCBI "GcneM^ 

Qi^'f^mxrxK nr hi tilm nlh gov/gftTiem^, can be employed to det^rmme if previously identified disease 
g^ies inap wifliin or in proxhnrty to the intervab indicated ato 
Vn. Analy^ of Ptolynudeotide Expression 
• 20 Nordimi analysis is a laboratory tedmique used 1o detect the 

gene and involves the bybxidizatian of a labeled nucleotide sequence to a membrane on whidi RNAs 
fri>mapaiticolarceniypeQrtissaehavBbe»boatid (See,e.g.»Sanibrook»snQm,di.7; Ausubel 
(1995) supra, ch. 4 and 16.) 

Anabgons computer tecbniques a^lymg BLAST were used to seardi for id^cal or related 
25 inolecules in cDNA databases sadhi as GenBarik or IJF^ Bus analysis is 

imichfristerftaniiiukipleinenaibrane^iased hybridize In addition, Ac sensitivity of die compute 
seaidi can be modified to determine wheSi^ any particular match is categorized as exact or similar. 
Hie basis of flie seardi is die product score, which is defined as: 

30 - BLAST Score xPercmt Identity 

5 xmhuranm {leaigfli(Seq- 1), lengfti(Seq. 2)} 

The product score takes into accoimtboth die degtee of similarity betwera two sequences and the 
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l^igth of flic sequence match. The product score is a normalized value between 0 and 100, and is 
calculated as follows: the BLAST score is multiplied by the percent nucleotide identity and the 
product is divided by (5 times the length of the shorts of the two sequences). The BLAST score is 
calculated by assigning a score of +5 for ev^ base that matches in a highhscoring segment pair 

5 (IBP), and-4forevCTymismatdbu Two sequences may share more than one HSP (separated by 
gaps). If fliOTJ is more than one HSP, thai the pair witih the highest BLAST score is used to calculate 
the product score. The product score r^iresCTts a balance between fractional overlap and quality in a 
BLAST alignment For example, a product score of 100 is produced only fox 100% identity ova: the 
entire length of the shorter of the two sequences being compared. A product score of 70 is produced 

10 dther by 100% identity and 70% overlap at one end, or by 88% identity and 100% ovcdap at the 

other. A product score of 50 is produced either by 100% identity and 50% ovedap at one end, or 79% 
identic and 100% overly. 

Altexnativ&ly, polynucleotide sequences encoding DMB are analyzed with respect to tibe tissue 
sources fiom which they were dmvBd- For exanqple, some full length sequences are assembled, at 

15 least in part, widioviEarhi^inglnc^dDI^ EadhcDNA sequence is 

dedvedfiom a (O^NA library constructed fiomalniman tissue Eadh human tissue is classified into 
one offhe following organAissue categories: cardiovascmlar system; connective tissue; digestive 
system; embryonic stmctores; endociitie syst^ exocrine glands; g^taiia, feoiale; gpnitaKa, male; 
> getm cells; hemic and immune system; fiver, rnuscoloskelctal system; nervous system; pancreas; 

20 respiratory system; s^ise organs; skin; stomatognatbic system; unclassifiedAnixed; or urinary tract 
The number of libraries in eac^ category is counted and divided by tibye total number of libraries across 
aDcategories. Similarly, 'hv'nsm tiRsaie \& classified into one of the following disease/candition 
categpries: cancer» cell line* developmentaU inBanunation, neurological, trauma, cardiovdscnlar, pooled, 
and otiier» and the mmiber of libraries in eadi category is counted and dividedby the total number of 

25 libraiies across aQ categpries. The lesnMtig percentages reflect die tissue^ and disease-specific 
exioesdon of cDNA encoding DME. cDNA sequences and cDNA library/tissue information are 
found in Ibe LIFESEQ GOLD database (Tncyte Genomics, Pak Alto CA). 
Ym. Extension of DME Encoding Pdlymideotides 

Pan lengdi potynucleotide sequences were also produced by extension of an appropriate 

30 fiagm^ of the foil length molecule using ofigonucleotide primers designed from this fragment One 
primer was synthesized to initiate 5'extensbn of the known fragment, and die other i»imer was 
synlhesizedtoiiutiate3'eactensionof1faclaK>wnfra^ The initial prian^s were designed using 
OIIGO 4.06 software (National BiosdeaicesX or anotiier qipropriate program, to be about 22 to 30 
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imcleotides in length, to have a GC content of about 5 0% or more, and to anneal to the target 
sequence at tEanpesratnres of about 68 ""C to about 72 "C. Any stretch of nucleotLdes whidi would 
result in hairpin structures and prinaer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one 

5 extcnsionwasnecessaryordesired,additionalornestedsetsof primers were designed. 

High fidelity amplification was ohtained by PGR using methods well known in the art PGR 
was performed in 96-wen plates using the FrC-200 thennal cycler (MJ Research, Inc.). The reaction 
mix contained DNA ten^late, 200 nmol of each primer, reaction buffer containing Mg^, 0SnB4)jSO4, 
and 2-mercaptoeflianol, Taq DNA polymerase (Amersham Pharmada Biotech), ELONGASB 

10 enzyme (life Technologies), and Pfu DNA polymerase (Stratagcne), wiflittie following parameters 
for primer pair pa A and PaB: Step 1: 3 min; Step 2: 94'*G, 15 sec; Step 3: fiO'^C, 1 mm; 
Step 4: 68"C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 tiooes; Step 6: e&^'C, 5 min; Step 7: storage 
at 4"C 111 fte aHemadve, the parameters for primer pair T7 and SK+ were as folows: Step 1: 94**C, 
3 min; Step 2: 94°C, 15 sec; Step 3: 57*^C 1 min; Step 4: eS^'C, 2 11^ 

15 repeated 20 tiines; Step 6: 68*C, 5 inin; Step 7: storage at 4*C 

The concentratioiiof DNAin eadi well was deCraninedby dispensing 100 jUlPICOGREEN 
quantttaficm leag^ (0^25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 
and 03 Ml of undihted PCR product into eadi wcB of an opaque ftaorimetB^ 
Acton MA), alkiwing the DNA to bind to the reagent The plate was scanned in a Huoroskanll 

20 0^bsystCTnsOy,Helsin!d,Hnland)tome^^ 

conceaitrationofDNA. A5 Adto lOidaBquotoffheieactionmixti^ 

electrophoresis on a 1 % agarose g&lto d^mnne whidi reactions were successful m extending the 
sequence. 

Ibe ^tended nucleotides were desalted and concoitrated, transfocred to 384-weIl plates, 
25 di^sled wifih CviJI dtol^ virus ^onuclease (Molecular Biology Research, Madison WI), and 
sonicated or ^leated prior to religation mto pUC 18 vector (Amersham Pharmacia Biotech). For 
shotgw sequ»idng, flie digpsted nucleotides wctc separated on low concentration (0.6 to 0.8%) 
agarose gpls, fragments wore excised, and agar digested wilh Agar ACE (Promega). Extended 
clones were relighted usmg T4 ligase (New England Biolabs, Beverly MA) mto pUC 18 vector 
30 (Amersham Pharmacia Biotech), treated witti Pfu DNA potytnerase (Stratagene) to M-in restriction 
site ovohangs, and transfected into competent cells. Transformed cells w^ selected on 
antibiotic-K^ontainmg media, and ii^vidual colonies were picked and cultorcd overnight at 37 m 384- 
weH {dates in LB/2X carb Squid media. 
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The cells wcie lysed, and DNA was atnplificd by PGR using Taq DNA polymerase 
(Ameisham Pharmacia Biotech) and Ffo DNA polymerase (Stratagene) with the following 
parameters: Step 1: 94°C, 3 min; Step 2: 94X, 15 sec; Step 3: 60X, 1 noin; Step 4: 72°C, 2 min; Step 
5: steps 2, 3, and 4 repeated 29 times; Step 6: 72°C, 5 min; Step 7: storage at 4°C DNA was 
quantified by PICOOIEEN reagent (Molecular Probes) as described above. Samples wilh low DNA 
recoveries were reampliiied using the same conditions as described above. San^iles ware dihited with 
20% dimelhysulfoxide (1-.2, vAr), and sequenced using DYENAMIC energy transfer sequencing 
primers and the DYENAMIC DIRECT Mt (Amersham Pharmada Biotech) or &e ABI PRISM 
BIGD YE Terminator cycle sequencing ready reaction Idt (Applied Biosyst^ns). 

In like manner^ fall length polynucleotide sequences are verified using the above procedure or 
are used to obtain 5* regulatory sequences using tihe aibove procediue cdong wiifli olgonucleolides 
designed for sudi extension, and an appropriate genomic librazy. 
DL Labeling and Use of Ihdiviflual Hybridization Probes 

Hybridization probes derived &om SEQ ID NO:20-38 are e^loyed to scami cDNAs» 
gEaipmic DNAs, or mRNAs. Aldiough tibe labeling of otigonucleottdes, consisting of about 20 base 
pairs , is spedficaUy descxibed, essential^ ttie same procedure is used m& latg^ nocleotide 
fragments. Oligonucleottdes are design^ usi^gstate^f-dle-aIt software su^ 
software (National Biosciences) and labeled by combinh^ SO pmol of eadh oligofmer^ 250 juQ of 
[y.32p] ad^iDsme triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase 
(DuPont NEN» Boston MA). The labeled oligpnucleotides are substantialLy purified using a 
SEFHADBX G-2S supetfiiuB size racclnsion deiitran bead column (Amersham Fhatmada Biotedi). 
An aliqaot conteiniog 10^ counts per minute of die labeled pirobe is used in a t^»calnQcaribrane4>ased 
hybridization analysis of buinan gnomic DNA digested widi one of die following endonucleases: Ase 
I, Bgl n. Eco RI, Pst I, Xba I, or Pvu n (DoPont NEN). 

The DNA from each digest is fractionated on a 0.7 % agarose gel and trans^ied to nylon 
membranes (Nytran Phis, ScUeidier & Sdiuell, Dudiam NH}. Hybridization is carried out ibr 16 
hours at 40 ""C To remove nonspecific signals^ blots are sequ^tiaUy wasbed at room temperature 
ui^er conditions of up to, fr>r ^cample, 0.1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 
Hybridizadon patterns are visualized using autoradiography or an alternative imaging means and 
compared. 

X. Microarrays 

The linkage or synthesis of array elements upon a midoarray can be achieved utilizing 
photolilhogr^hy, piezoe^ctric printing (ink-jet printing. See, e.g,, BaldescbweOer, supra.), mechanical 
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iiucn>5pottit^tecli^ ThesabstratBineac^of dieafbtGrnei^ 

tedmologLes should be utiiform and solid with a non-poious surfoce (Schena (1999), supra). 
Suggested substrates include silicon, silica, glass slides, glass ddps, and sUicon wafers. Alternative^, a 
procedure analo gous to a dot or slot blot may also be used to arrange and link elements to die surface 

5 of a substrate using thermal, DV, c^enucal, or mechanical bonding procedure A typical array may 
be produced using avaUab^ methods and machines weHlaiDwn to those of ordinary skill in the art and 
may contain any appropriate number of elements. (See, e.g., Sdiena, M. et aL (1995) Science 
270:467-470; Shalon, D. et aL (1996) Geaoome Res. 6:639-645; Marshall, A. and J. Hbdgjson (1998) 
Nat BiotechnoL 16:27-31.) 

10 FuB lengdi cDNAs, Expressed Sequence Tags (ESTs), or fragments or oligomers thereof may 

comprise the eletoents of die microaiTay. B^agments or oligonaers suitable for bybri^zadon can be 
selected using software wen known mttie art such as lASERGEN^ Ibe 
array elemoAts are bybridized widi polynuckolides in a biologiccd sample. The polymickotides in die 
biological sanq>le are conpigatcd to a fluoiesccnt label or other molecular tag for ease of detection. 

IS After hybridization, nonihybiidized nueleotides from die biolog^bal sample are removed, and a 
fhiorescencescanni^ is used to detect tq/bridization at each array el^ Aginatively, las^ 
desoibtion and mass spectrometry may be used for detection of hybridization. The degree of 
compl^entatity and die relative abundance of eadi polynucleotide vi/bicTx hybridizes to an element on 
the microatray may be assess. InoneemlM>dini^)t,microanaypr^arationandnsag&lsdescdb 

20 in detail below. 

Tissue or Cdl Sample Preparation 

Total KNA is isobted from tissue saoqiles using the gaanidimtmdiiocyanatemediod and 
poly(A)*^BNA is purified udng die oligD-<dT)ce]lnlosex^ Bachpoly(A>*^RNAsaDq»leis 
reverse transcribed using MMLV tevers&-ttanscriptase, O.OS pg/fd oligp-(dT) primer ^Imer), IX first 

25 strand buf^, 0.03 nnits/^1 RNase inhibitor, 500 fiM dATP, 500 nM dGTP, 500 mM dTTP, 40 |iM 
dCrP,40;MdCIP-Cy3(BDS)ordCIT-C^(Amc^^ Thcreveise 
transcri{^on reaction is p^oimed in a 25 ml volume containing 200 ng poly(A)* RNA with 
(IEMBI0(Hiridt5(Ihcyte). Specific control polyf A)^ RNAs ar& syndiesized by in vitro transcription 
from non-coding yeast genomic DNA. After incubation at 37° C for 2 hr, each reaction sample (one 

30 with Gy3 and another widi Cy5 labeling) is treated widi 2.5 nd of 0.5M sodium hydroxide and 

incubated for 20 minutes at 85° C to the stop the reaction and degrade the RNA. Samples are purified 
using two successive CHROMA SPIN 30 gpl filtration spin columns (CLONTECH Laboratories^ Inc. 
(CLONrECH)» Palo Alto GA) and after conibining, both reaction samples are ethanol predpitated 
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using 1 ml of glycogen (1 mg/ml), 60 ml sodium acetate, and 300 ml of 100% ethanoL The sample is 
then dried to completion using a SpeedVAC (Savant Listnunents hic, Holbrook NY) and lesQspended 
in 14 ^1 5X SSCA)-2% SDS. 
Microairay Preparation 

5 Sequences of the present invention are used to generate array elements. Each array element 

is amplified j&nm bacterial cells containing vectors with cloraed cDNA inserts. PGR amplification uses 
primers complementary to ftie vector sequ^ces flanking the cDNA insert Array elements are 
amplified in thirty cycles of PGR from an initial quantity of 1-2 ng to a final quantity greater than 5 /ig. 
Anq)lified array efements are then purified using SEFHACRYLr400 (Amersham Pharmacia Biotedli). 

10 Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 

slides (Coming) are cleaned by uhrasound in 0. 1% SDS and acetone, widi extensive distDled water 
washes between and after treatments. Glass slides axe etdied in 4% hydrcyjBhioric add (VWR 
Sciraitific Products Corpotation (VWR), West Chests PA), washed ^tensively m distilled wat^, and 
coated witih 0.05% aminopropylsilane (Sigma) in 95% eflianoL Coated slides are cured in a 110*C 

15 oven. 

Array elements am applied to die coated glass substrate using a procedore des(^ 
Patent No. 5,807,522, incorporated hradnby reference. 1 ftl of the array etenent DNA, at aa.av^^rage 
ccHicentration of 100 ng/fd, is loaded into the open capiDaiy printmg element by a high-speed robotic 
apparatus. Theqiparatus&endqM>sitsabont5nlof antiyel^iiea^ 

20 I^croarrays are UV-ccossfinked nsmg a STRATAUNEER UV-crosslmk^ (Stnitag^). 

Microarra]?s are washed at loom t^nperature once in 0.2% SDS ai^dusetm^ . 
Non-spedfic binding sites are blocked by jncubation of microairays in 0^% casein in phosphate 
buiBfeted saline (PBS) (Tiopix, Inc., Bedford MA) for 30 minutes at 60^C followed by washes in 0.2% 
SDS and distilled water as before. 

25 Hybridization 

Hybridizatioii reactions oontmn 9 ;d of sample mixture consisting of 0.2 fig each of Cy3 and 
Cs5 labeled cDNA synlhcsis products in 5X SSC, 0.2% SDS hybridization bnfier. The san^Ie 
mixture is heated to 65*" C for 5 minutes and is aliquoted onto the micro array surface and covered with 
an 1.8 cm^ coverslip. The arrays are transferred to a waterproof chamber having a cavity just slightly 
30 larger than a microscppe slide. The chanaber is kept at 100% humidity intern^y by the addition of 140 
/il of 5XSSC in a comer of the chamber. The chamb^ containing the arrays is incubated for about 
6-5honrsat60**C. The arrays are washed for 10 min at 45*C in a first wash buffo (IX SSC, 0.1% 
SDS), three times for 10 minutes each at 45**C in a second wash buffer (O.IX SSC), and dried. 
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Detection 

Reporter-labeled hybridization complexes are detected with a microscope equipped wifli an 
hmova 70 mixed gas 10 W laser (Coh^ent, Inc., Santa Clara CA) capable of generating spectral lines 
at 488 nm for excitation of Cy3 and at 632 mn for excitation of Cy5. The excitation laser light is 

5 focused on the array nsing a 20X nodcroscope objective (Mkon, Inc., Melville NY). The slide 
containing the array is placed on a comput^'K:ontroned X-Y stage on fhc microscope and raster- 
scanned past the objective. The 1.8 cm x 1.8 can array used m the present example is scanned with a 
resolution of 20 micrometers. 

In two separate scans, a mixed gas multiline laser excites the two fluoiophores sequentially. 

10 Emitted hght is split, based on wavelengfli, into two photomuMplier tube detectors (IMT R1477, 

Hamamatsu Photonics Systems, Biidgpwater NJ) corresponding to the two fluorophores. Apptopriate 
filters positioned between fteanay and file photonmltipHer tubes aie used t^ The 
enaissionmaxiimofthefhiorophoresusedare565nmforCy3and650^ EacJiarrayis 
typically scanned twice, one scan p^ flnoiotAore u»ng tiie appiopciate filters at ftie lasM source, 

15 aWiough the apparatus is cspMo of recording flie spectra frombotii fluorophores simultaneous^. 

The sraisitivity of Ab scans is ^kally calihratediisu^ flie sigiml intensity g^n£3:ated by a 
cDNA control species added to tfiesanqple mixture at a known i^ncc^^ A specafic location on 
^ array contains a cotD|i]emcaitaiy DNA sequence, aDo wing the intensity of flie signal at that location 
to be correlated wifti a w«^ ratio of h3^>ridi2ingspede5 of ^^^^ When two samples from 

20 diffCTcnt sources (e.g.,reparcsaitiijg test and control celkX 

are hybridized to a smgle array for tibe purpose of identifying genes tiiat are differ^itially eaqpressed, 
the cam>ratian is done by labdmg sanqiles of flie calibrating cDNA wifli flie two fbiorophDres and 
addii]\g identical amounts of eadbi to flie hybridization iiiixtnie. 

The output of flie photoniulti^lier tube is digitized using a 124>it RTI-835H analog-to-dig^tal 

25 (A/D) convmion board (Analog Devices, Inc., Norwood MA) installed in an ffiM-corapatible PC 
counter. The digitized data are dijq>kyed as an image where the signal intensi^ 
linear 20-c»tor transformalion to a pseudocolor scate rangmg from blue (lo^ 
signal). The data is also analyzed quantitative^. Where two different fluorophores are excited and 
measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping emission 

30 spectra) between the fluorophores using eat^ fhiorophore's emission spectrum. 

A grid is superin:4>osed over the fluorescence signal image sn ch fliat the signal from eac^ spot 
is cCTitMed in each element of die grid. The ftoorescence signal witiuneadi element is flien integrated 
to obtain a numerical value corresponding to flie aviarageintE^ The software used 
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for signal analysis is tbe GEMT(X)LS gene e^qpress^ 
XI. Complementary Polynucleotides 

Sequences conqilementary to the DME-encoding sequences, or any parts thereof, are used to 
detect, decrease, or inhibit expression of natoraDy occorring DME. Although use of oligonucleotides 
5 comprising from about 15 to 30 base pairs is described, essentially the same procedure is used with 
smaDer or with larger sequence fragments. Appropriate oligonucleotides are desigoed using OLIGO 
4.06 software (National Biosdences) and the coding sequence of DME. To inhibit transcmpdon, a 
conq)lementary oligonucleotide is designed from flje most unique 5 'seqpience and used to prevent 
promoter binding to fiie coding sequence. To inhibit translation, a comptonraitaiy oligonucleotide is 
10 designed to pjrventjdbosonialbiDdiiig to Ac DMB-ca^^ 
xn. Expressicmof DME 

Expression and purification of DME is achieved usmg bact^al or vims-based expression 
systems. I^r expression of DME in bact»ia,d>NA is subcloned into an 

containing an antibiotic resistance g^ and an indadbfe promoter that directs hi^ levels of cDNA 

15 transcription. Exanq)]es of sd(^ promoters indode, but are not ttnnted to, ti^ 

promoter and tibe TS or T7 baclmopiiage piomot^ in conjunction witili ^ lac operator regulatory 
element Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(PB3). 
Amihiotic resistant bacteria ejqjfess DME upon induction with isopropylbeta-D-ftiogalactopyranoside 
(DPTG). EsqpresdonofDMBineukaryotbcelkisaduevedbyinfecti^ 

20 lines wiih reconibinant Autographica caHfomica nuclear pol^iedrosis vims (AcMNPV), commonly 
known as baculovirus. The nonessential pol^bediiug^ of bacnlo^iros is repla^ 
encoding DME by dfherbomologous recombination or bacterialnniodiatedtranspositfoninvol^^ 
transfer plasnoidintesmiediates. Viral infectivi^ is maintainind and the stronig polyhedrin promoter 
drives hjghlevdts of cDNAtranscr^tk>n. l^ecombinant haculovirus is used to infect Spodoptera 

25 fiugiperda (Sf^) insect ceBs in most cases, or human hepatocytes, in some cases. Infection of the 

latter leqniresadditkjnalgiMjeticniodifications to baculoviras. (See Engelhard, E.K- ct aL (1994) Proa 
NatL Acad. ScL USA 913224-3227; Sandig, V. et al. (1996) Ham, Gene Ther. 7:1937-1945.) 

In most expression systems, DME is synflhuesized as a fusion protein with, e.g., gbitathione S- 
transferase (GST) or a peptide epitope tag, such as FLAG or 6-HQs, permitting rapid, smgle-step, 

30 afBnity-based purification of recombinant fosion protein from crude cell lysates. GST, a 26«kilodalton 
enzyme from Schistosoma iaponicom , enables the purification of foaon proteins on immobilized 
glntathione undex conditions tot maintain protein activily and antigeniuty (Ameisham Pharmacia 
Biotedi). FoUowingpnrification, the GST inoie^ can be proteolytica% cleaved from DK^ 
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spedficaDy cngmeered sites. FLAG, an S-amino add peptide, enables inmiunoaffinity purification 
using conuiKTdally available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak). 6- 
IBs, a stretdi of six consecutive histidine residues, enables purification on metal-dielate resins 
(QIAGEN). Methods for protein eiqpiession and purification axe discussed in Ausubel (1995, supra , 

5 clLlOandl6), Purified DMEol)tained by liieseniethods can be nsed directly in Ae assays shown in 
Exan4)lesXVI,XVII,andXVID[, where applicable. 
:KIII. Functional Assays 

DMB fiinction is assessed by caressing tibie sequences encoding DME at physiologically 
elevated levels in mainmalian ceU culture systrans. cDNA is suhcloned into a m am m alian expression 

10 vecU)r containing a strong proniotorfiiat drives high levck Vectors of dioice 

inchide PCMV SPORT (Life Tedmologies) and PCR3.1 (Invitiogeai, Clarlsbad CA), botii of which 
contain fliecytoinegsJovirusprDmoteT. 5-10 /ig of recombinant vector are ttansieoftytransfected into 
a human cell Bne, for eixample^ an endothelial or hmiatopoietic cell line, using either liposome 
fonmdations or electxoporation. 1-2 Mg of an additional plasmid containing seqn^ices encoding a 

15 maAerprotdnareeo-transfected Bjpresswn of a noaffker protean provides a means Id disti^ 
transfected celk fcrai nontransfectjed celb and is a teHa^ 

lecombmant vector. Markw proteins of.dK>ice include, e.g,. Green Ehiorescent lutein (GFP 
Ctontedi),CI)64, or a (3)64-(a^ fusion protein. How cytanicfay(FC3M0, an aatoniated, laser optics- 
based tedmiqae, is nsed to idwilifytransfected cells e^qf^ • 

20 flieapoptotic state ofthecelb and other ceDularpropaties. FCM detects and qoantifiesihe uptake of 
fborescentmokcales that diagnose events preceding or coincideiitwi^c^^ Theseevraits 
include change in nuclear DNA content as measured by staining of DNA wifli propidinm iodide; 
changes m odDi size and granularity as measured by forward light scatty and 90 degree side li^t 
scatty, down-r^olation of DNA synthesis as measured by decrease m bromodeoxyuriduie uptake; 

25 alterations in e^^qxres^n of ceD snrfece and intracelhilar poxiteins as measured by leactivily with 
specific antibodies; and alterations m pla^a membrane composition as measured by the binding of 
fluoicsccin-^njugatedAnncxinVprotdbDL to the cell surface. Mediods in fk)W cytomctiy arc 
discussed in Qrmerod, M.G. (1994) Plow Cvtometrv . Oxford, New York NY. 

The influ^ice of DME on gene expression can be assessed using highly piirified populations 

30 of cells tosfected wifli sequences encoding DME and dther CD64 or CD64-GFP. CD64 and 

(3)64-GFP are expressed on die surface of transfected cells and biod to conserved regions of human 
immunoglobulin G (IgG). Transfiscted ceBs are effidendy separated fit>m nontransfected cells using 
magnetic beads coated with eilhea: human IgG or antibody against 01)64 (DYNAL, Lake Success 
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NY). mRNA can be purified ftomtlie cells using methods wen 

Expression of mRN A encoding DM£ and other g^oes of interest can he analyzed by nortbem amdysis 
or microarray techniques. 

XIV. Production of DME Specific AnUbodies 

5 DME substantial]^ purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 

Harrington, M.G. (1990) Methods Enzyniol 182:488-495), or other purification techniques, is used to 
inuxiDiiize rabbits and to produce antibodies using standard protocols. 

AltemativBly, the DME atnino acid sequence is ana^^zed using LASERGENE software 
(DNASTAR) to determine regions of hi^ immunogenicity, and a corresponding oligopeptide is 

10 synthesized and used to raise antibodies by ineanslmown to those of sldUi^ Methodsfor 
selection of appropriate epitopes, such as those near the C-teiminus or in hydrofdiilic regions are well 
described in the art (See, e.g., Ausubel, 1995» supra > ch 11.) 

Topically, oligopeptides of about 15 residues in length are syndiesized using an ABI 43 lA 
peptide synfhcsizor (A]^fiedBiosystenis)iishigFMOCchcmistEy and coupled to KI^ 

15 AUridi* St Louis MO) by leacdon with N-m«1ftimidobenzDyi-N4iydrDxysuccinimid& estei* (MBS) to 
increase immnnogenidty. (See, e.g., Ausubel, 1995, supra. ) Rabbits are immunized with tihe 
oHgppeptide-KLH compl&x in complete Freund's adjuvant Resulting antisera are tested for 
antipeptide and anti-DME activity by, for exanq>le, binding die peptide or DME to a substrate^ bloddng 
with 1% BSA, reacting with rabbit antisera, washmg, and reacting with radio-iodinated goat and-rabbit 

20 feG. 

XV. Purificaiionof Nattu^yOccumngDMEUa^ 

Natnraify occordng or recombinant DME is sabstantiaDy puriiBed by imnmnoafBnity 
cfaromatogFii[%u$faig antibodies ^ledfic for D^ An immimoafBnity column is constructed by 
coval^y coupling and-DMB antibody to an activated duromatographic resin, such as CNBr-activated 

25 SEf%[AROSE(Ain£fsfaamFluaiii^ 

according to 13^ manafiactur^^ instractions. 

Media containtng DME are passed ovor the immunoaffinity column, and the column is washed 
under conditions diat allow die prefer^tial absorbance of DME (e^g., hig^ ionic streaigth buffers in the 
presence of detergent). The column is eluted under conditions that dismpt antibody/DME binding 

30 (e.g., a buffer of pH 2 to pH 3, or a conceaiti^on of a chaqtrope^ such as urea or thipcyanate 
ion), and DME is collected. 

XVL Identfflcatlan of Molecules mich Interact with DME 
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DME, or biologicaify active fragments flieieof, are labeled with Boltoi]hHimter reagent 
(See, e.g.. Bokon A.E. and WM. Hunter (1973) Biochem. J. 133-529-539.) Candidate molecules 
previously arrayed in flie weDs of a nniki-well plate are incubated with the labeled DME, washed, and 
any weDs with labeled DME complex are assayed. Data obtained using differrait concenlralions of 
5 DME are used to calculate values for the number, afBnity, and association of DME with the candidate 
molecules. 

Alternatively, molecules inter^:tiDg with DME axe anzdyzed using die yeast two-hybrid system 
as described in HeMs, S, and O. Song (1989) Nature 340245-246, or using commercially available kits 
based on the two-hybrid system, such as the MATCHMAKER system (Clontech). 

10 DME loay also be used in fliePATHCAIIJNG process (QiraG^^ 

which employs the yeast two-hybrid system in a higji-througbput mann^ to detotmine all interactions 
between the proteins caticodod by two large libraries of g^ies (Nandabalan, K. et aL (2000) U.S. 
Pateant No. 6,057,101). 
XVn. Demonstration of DME Acttvity 

15 CytDcau»in©P450 activity of DME is ineasuredusfaigflie4-hyd^ 

converted to 4-aminopheDol by flie oozyme, and has an absorption ma^mum at 63 0 nm (Gibson and . 
Skett, snpraY This assay is a convenient measure, but und^cestimates the total hydroxylati0n, which 
also occurs at the 2- and 3~posidons. Assays are performed at 37**C and contain an ailiqaot of die 
^izyme and a suitable amoimt of aniline (approximately 2 mM) in reaction buffer. For this reaction, 

20 the buffer iniist contain NADEB or an NADPH-g^CTalingcof^^ Qnefbimnlationfor&is 
reactionbufEbi: inchides 85 nM Tris pH 7.4, 15 mM MgCt 50 mM nicotinamide, 40 mg trisodimn 
Isodtrate, aiid 2 units isodlrto ddhcydn>gcnase> wit 

stockjust prior to assay. Reactions are earned out in an optical cuvette» and 1^ 

is measured. The rate of increase in absotbance is proportional to the enzyme activity in die assay. A 

25 standard curve can be constructed usir^ known concentratioiis of 4-aminDi^ 

la,25-^ydroayvitanMnD 24^ydroxylase activity of DME is det^mined by monitoring die 
convrasian of %t-labeled lo25-dihydroxyvitamni D (la^(OH)jD) to 24;25-dihydroxyvitamm D 
(24;25(OH)2D) in transgenic rats ejipressing DME. 1 /ig of la^5(OHyD dissolved in ethanol (or 
ethanol alone as a control) is administered intravenously to approximately 6-week-old male transgenic 

30 rats es^xressing DME or otherwise identical cmtrol rats expressmg either a defective variant of DME 
orn0te2)»:essmgDME. The rats are killed by decapitation after 8 hrs, and the Mdneys are rapi 
r^noved^ rinsed, and homogenized in 9 volumes of ice-cold buffer (15 mM Tris-acetate (pH 7.4), 0. 19 
M sucrose, 2 inM magnesium acetate, and 5 rnMsodhim succinate). A portion (e.g,, 3 ml) of each 
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honH)g^ate is tttcn incubated wifli 0.25 nM l(a5(0H)j[l-W), with a spedJBc activity of 
proximately 3.5 GBq/mmol, for 15 nun at 37 °C nnder oxygen with constant shaking. Total lipids 
are extracted as described (Bli^ RG. and WJ. Dyer (1959) Can. J. BiochenL Physiol. 37: 911-917) 
and the chloroform phase is anatyzed by HPLC using a jPINEPAK SJL column (JASCO, Tokyo, 

5 Japan) wifli a n-hexane/chloroform/tneliianol (10:2.5:1.5) solvent system at a flow rate of 1 nolAxm In 
flic aitemative, flie chloroform phase is analyzed by reverse phase HPLC using a J SPHERE 
ODS-AM column (YMC Co. Ltd., Kyoto, Japan) witih an acetonitritebiiflBMr system (40 to 100%, m 
water, in 30 min) at a flow rate of 1 ml/miiL The ehiates are coDected m fractions of 30 seconds (or 
less) and the amonnt of ^present in each fraction is measo^ By 

10 conqjaiing the chromatogtams of control samples (Le., samples comprising lo^S-dihydroxyvitamin D 
or24;25-dihydroxyvitainmD (24,25(OH)jD), witii the chiomatog 

relative mobilities of the sobslrate (laj25(OH)j[l-^) and product (24,25(OH)j[l-=^) are 
deteacnuned and condated wKhtiie fractions coDectedL The amount of 24,25(OH)2[l-'BP prodaced 
in control rats is subtracted from fliat of transgoiiciatscjq^^ IhcdifBaresncein&e 

15 productianaf24,25(OI©i[l-^H]DintiiBtransgemcandcoito 

of 25-hydiDlase activity of DME present in tiie sample. Ccmfiinnation of tiie identity of tiiesabstrate 
and prodact(s) is confirmed by means of mass spectroscopy (Miyamoto, Y. et al. (1997) J. BioL 
Chfim. 272:14115-14119). 

Flavin-containing monooxygpnase actiioty of DM£ is measured by chromatographic analyds 

20 of metabolic products. For example. King, B. J, aL (1999; Drag Metab. Dis. 27:1099-1103) 
incubated FMO m 0. 1 M sodium jdiosphate bnffer (pH 7.4 or 8.3) and 1 noM NADPH at 37 **C, 
stopped tiie reaction with an organic solvi^ and determined product formation by HFLC 
. Allranative^y,aclivily is ineasnted by monilDring oxygen uptake n^ 
exam|>lB» Ziegler, D. M. and Ptoulsrai, L. L. (1978; Methods EnzymoL 52:142-151) mcubated tiie 

25 Kizyme at 37 in an NADPH-generating cofactor system (similar to the one described above) 
containmg ftx> substrate methimazole. The rate of oxygen uptake is proportional to enzyme activity. 

UDP ghicoranytoransferase activity of DME is measured usmg a colorimetric dete^cmination of 
free amine groups ((3ibson and Skett, supra\ An amine-containing substrate, sudi as 2-aminophenol, is 
mcubated at 37 °C witii an aliquot of tiie enzyme in a reaction buflfi^ contaming the necessary 

30 cofectors (40 mM Tris pH 8.0, 7.5 mM MgClj, 0.025% Triton X-100, 1 mM ascorbic acid, 0.75 mM 
UDF'ghicuronic acid). Aft^ suflfident time, tiie reaction is stopped by addition of ice^ld 20% 
triddoro^^tic acid in 0.1 M phosphate buffer pH 2.7, incubated on ice, and centrifaged to clarify die 
supernatant Any xmreacted 2*aminophBnol is destroyed in this step. Suffident freshly-prepared 
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sodhim nitrite is then added; tim step allows fotmation of the diazonium salt of tiae ghicaronidated 
product Excess nitrite is itanoved by addition of sufficient ammonium sulfamate, and the diazonium 
salt is reacted with an aromatic amine (for exampfe, N-naphflbLylethylene diamine) to produce a colored 
azo compound wMdi can be assayed spectrophotometrically (at 540 nm for the example). A standard 

5 curve can be constructed using known concentrations of aniline, which will form a chromophoic with 
sinnlar jwropcrties to 2-annnophcnol ghicnronide. 

Sulfotransferase activity of DME is measured using Ihe incoiporation of from p^S JPAPS 
into a model substrate sucb as phenol (Folds, A. and Meek, J. L. (1973) Biodrim. Biophys. Acta 
327:365-374). An aliquot of enzjane is incubated at 37 "C wifli 1 niL of 10 mM phosphate buffer pH 

10 6.4, 50 ^iM phenol, 0.44.0 \jM pSJPAPS. After suffid^ time for 5-20% of the radiolabel to be 
transferted to the substrate, 0.2 mL of 0. 1 M barium acetate is added to precq>itate protein and 
phosphate buffca:. Then 0.2 mL of 0.1 M Ba(QH)2 is added, followed by 0.2 iiiL ZnSO+. The 
supernatarit is cleared by cesttrifogation, wMch retooves piotdns a^ 

Radioactivity in flie supernatant is measured by sdntillatioa The enzyme activity is detemuned fiam 

15 . ' the rmnob^ of moles of radioactivity in the reaction product 

GQataHuone S-transf^ase activi^ of DME is measured using a model substrate, such as 2»4- 
dinitro-l-dUorobenzene, wMdh reacts with ghitathione to form a product, 2»4-dhnttophenyl-gIatad]ione, 
that has an absorbance maximum at 340 nm. It is important to notediat GSTs have difTraing substrate 
specificities, and die jmodel substrate should be selected based on &e substrate preferences of the 

20 GST of int^est Assays are performed at ambient t^Dap^:ature atid contain an aliquot of the enzyras^ 
• in a sniitable reaction buffer (for exati^le, IniMglntafliion^ 

potassrom phosphate buflfer pH 6.5). Reactions arc carried out in an optical cavette, and Ihc 
absorbance at 340 nm is ineasured The rate of increase in absofbanoe is pr^ 
activity in the assay. 

25 N-acyltransferase activi^ of DME is rneasured using radiolabeled an^ 

measuring radiolabel incorporation into conjugated products. !&izyme is incubated in a reaction bnfi^ 
containing an unlabeled acyl~CoA amapound and radiolabeled amino acdd» and the radiolabeled ac^lr 
conjugates are separated fiom the nnreacted amino acid by extraction mto n-bntanol or oth^ 
appropriate organic solvent For example^ Johnson, M. R. et aL (1990; J. BioL Chem. 266:10227- 

30 10233) measured bile add-CoA:amino add N-acjtoansforase activity by incubating the enzyDDu& with 
chol^-CoA and ^-glycine or ^-taurine, separating the tritiatcd diolate conjugate by extraction into 
n-butanol, and measuring the radioactivity in the extracted product by sdntillatioiL Alternatively, N- 
acyhransferase activity h measured using the spectrophotometric detertoination of reduced CoA 
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(CoASH) described below. 

N-acetytaansferase activity of DME is measured using the transfer of radiolabd fiom 

pX:jacetyH:oA to a substrate molecule (for example, sec DcgucH, T. (1975) J. NeoiodiEm. 

24:1083-5). Mtematively, a newer spectiophotometiic assay based on DTNB (5^*-difliio-bis(2- 
5 mtrobenzoicacid;Enmau'sreageut)ieactioBwi&CoASHmaybeused.B.« 

is fonned during N-acetyltransfcrase catalyzed transfer of anacetylgronpto a sobsttate. CoASH is 

detectedusingtheabsorbance bf DTNB conjugate at412nm(De Angelis. J. etaL (1997) J-BioL 

Chem. 273:3045-3050). Enzyme activity is proportional to the rate of radioactivilyinooipoiation into 

substrate, orthcrateofabsorbanceincreaseintiiespcctropholDmBtricassay. 
10 ProteinarghunemefliyltransferaseaclivityofDMEismeasuiedata^ 

oftime. S-adenosyl-Mmethyl-'Hlmeariomnc(PHlAdoMet;spo^ 

IifcScienceProdacts)isusedastheiuefliyWoiiarsubstrate. Usdamcfliyl-acoepting substrates 

inctade gtotathione S-transferasefibrinaiingbwn^ 
heterogeneous nucfeainTHHmdeoprotBia(lmR^ 
15 ad^iosinediaUdiyd^-trcatcdoells. Methytetionieac^ 

buffer. TJ«pioducteoffl«ieactk»sare«solvBdbySDS.FAGBai.dv^ 
Ttel«BS«K»ofni-M^ii>eaqa-Afflorsri^ 

activity of DME CTang. J. et aL (2000) J. BioL Chem. 275:7723-7730 and Tang, J. et (2^^ 

ChMn.275:19866-19876). 
20 Aldo/ketoreductaseactivityofDMBismcasuredusingthodecreaseiaabsorba^ 

asNADPHis consumed. Astandard«actionmbDtm»js 135 mM8odium0^^^ 

73dqKa>dingonaByme).0.2mMNADPH.03Mliflnomsal^ 

appropriate kvdofsubstnde. TteKactionis incubated at 3^^^ 

contiDUouslywifliaspecttophDton^ter. Enzyme activity is calculated as molNADFB consumed /Mg 
25 of QDzynie. 

Afcohol dehydrogpnase activity of DME is measured using increase in 
mnasNAD-isiedocedtoNADa A stundaidieaction mixture is 50 mM sodium phosphate, pH 7-5, 
and 0.25 mM EDTA. Hk reaction is incubated at 25 °C and monitoied using a spectrophotometer. 
Enzyme activity is cakdalBd as mol NADH ptodoced / M« of enzyme. 
30 DMB activity is detenrined using 4-nieaiylairibenifcryl acetate as a substrate. The 

enzymatic leactionisimtiatedby adding ap^ 

reaction bu&r (90 KHJPO4. 40 nM BO, pH 7.3) with 0.5 mM 4-me%tatxibeffifeiyl acetate 
at t:. The prodoctiDnof 4-niell^ytanibdlil^ 
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12.2 iriM-^ cur*) for 1.5 rsasL Specific activity is ejqpressed as inicronQDles of product formed per 
nmute per imDigram of ptotdn and coixesponds to the activity of DMB in the sample (^vgenia, V. ei 
al (1997) L Biol Cheia 272:14769-14775). 

la the alternative, the cocaine benzoj4 ester hydrolase activity of DMB is measured by 

5 iDciibatiDg ^iproximately 0. 1 id of enzyme 3.3 nM cocaine ia reaction haSkr (50 mM NaH2P04, 
pH7.4) with 1 inMbfgiy^fnrirtfpe, 1 itMEDTA, and 1 mM ditbiQtbreitol at 37 °C1 Hie reaction is 
incubated for 1 bin a total volume of 0.4 ml thm teroiinated with an equal vohme of 5% 
trichloroacetic acid. 0.1 mlof thBintOTialstandani3,4-dimelbyI^^ 
Precq>itated protein is separated by centrifugation at 12,000 x g for 10 min. The supernatant is 

10 transfored to a clean tube ar^ extracted twice with 0.4 ml of methylene chloride. The two extracts 
are combined and dried undo: a stream of nitrogen The residue is resuspended in 14% acetonitrile, 
250 mM KH2PO4, pH 4.0, with 8 ftl of diethylamine per 100 ml and ) and injected onto a C18 
rewrae-phaseHPlX:colnnmn for separation. The column ehiate was monitoied at 235 ma DMB 
activity is quantified by coiq»aring peak area ratios of the mai^t^ to the intanal standard- A 

15 standard curve was gm^ated with benzoic add standards prepared in a trichloroacetic add-treated 
protein matrix (Bvgenia, V. et aL (1997) J. BioL Chem. 272:14769-14775). 

In anotha: alternative, DMB carbojiyl esterase activity agamst the water-solubte substrate 
pararDitrophen^ butyric add is detained b^ 
drilled in the art In this ptocedi^re, the DME-contai^ 

20 (pH 7.4 or 8.0) or sodium acetate (pH 5.0) in the presmce of 6 hM taurocholate. The assay is 
initiated by adding a fieshty prq>aied para-nitrophenyl butyric add solution (100 ixgfnA in sodium 
acetate, pH 5.0). Carboxyl esterase activity was then monitored and compared with control 
autohydrolysis of the substrate using an spectrophotometer set at 405 ran (Wan, L. et aL (2000) J. 
Biol Caiem. 275:10041-10046). 

25 Bfeparan sulfate 6-sulfotransferase activity of DMB is measuied in vitro by incubating a 

sample containing DMB along with 2.5 ^mol imidazob HQ (pH 6.8), 3.75 ^g of protamine chloride, 25 
mnol (as hexosamine) of completely desutfated and N-resulfated heparin, and 50 pmol (about 5 x 10^ 
cpm) of pS] adenosine 3 '-{^losphate 5'i>hosphosulfete (PAPS) in a final reaction votame of 50 /il at 
37 ''CforZOmin. The reaction is stopped by immersing the reaction tubes in a boiling water ^ 

30 Imin. 0.1/mK)l(asghicuroiricadd)of chondroitinsoffiateAisaddedtot^ 

carriCT. ^S-labcled polysaccharides are piedpitatBd wifli 3 volumes of cold etiiaool containing 13% 
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potassium acetate and separated completely from iinincoiporated f ^SJPAFS and its degradation 
products by g^l chromatography usiug desaldng columns. One unit of enzyme activity is deiined as 
the amount required to transfer 1 pmol of sulfateAnin. as deteimmed by the amount of p^S JPAPS 
incorporated into the precipitated polysacdbarides (Habnchi, H. et al (1995) J. Biol Chem. 
5 270:4172-4179). 

hi the alternative, heparan sulfate 6-salfbtransferasc activity of DME is measured by 
extraction and tenaturation of enzyme ficom gpls following separation by sodium dodecyl sulfate 
polyacrylamide gel ekxrtrophoresis (SDS-PAGE). FoHowiDg separation, the gel is washed with 0.05 
M Tris-Hd pH 8.0, cut into 3-5 nam segmi^ts and subjected to agitation at 4 ""C wifli 100 ^1 of 0.05 
10 M Tiis-HCl, pH 8.0 containing 0.15 M NaQ for 48 h. The elated enz^c is collected by centnhigation 
and assayed for fbet snlfotransferase activity as above (Habuc^ H.et al (1995) J. Biol Oiem. 
270:4172-4179). 

In anoflier altemative, DME solfotransfbrase activity is determined by measoiing transf ^ 
of (^S]saUiate from [^SIPAFS to an immobilized pqptide. In one example, the peptide 

15 (QATBYBYIi^YDHJPBC) r^piesaits the N-tmoinal 15 residues of the mature P-selectin 

gl^coptotdnlig^nd>l polypeptide to whichis added C-tarminai cystdne residue. The peptide spans 
Oireepoteoitial tyrosine sulfotioXL sites. Thepeptideislinked via the cysteine residue to 
iodoacetamide-actiyatBd tesin at a density of 1.5-3.0 /unol peptideAnI of lesin. Ihe ^izyme assay is 
perfomoed by combining 10 ^1 of peptide-derivitized beads wifli 2-20 fd of DMErcontainiiig sample in 

20 40 mM Hpes (pH 6.8), 0.3 M NaO. 20 mM MnC^ 50 mM NaF, 1% Triton X-IOO. and 1 mM 
5'-AMP in a final volume of 130 ill The assay is initiated by addition of 0.5 iiG of pS]PAPS (1.7 
/iM;ia=2 37GBq). Afkxar30minat37X^1he]£actionbeadsarewashod with6Mgu^ 
65 and die radioactivity incorporated into the beads is detmnined by Hquid sdntillation counting. 
TransfiBT of [^Sjsulfate to the bead-associated p^tide is measured to detearmine ^ DM£ activity in 

25 thesampb. One unit of activity is defitied as 1 pmol of product formed pexiDin(0^ 
(1998) Biochemistry 95:2896-2901). 

In anotber altmiative, DME suUbtraosferase assays are paibrmed using p^SJPAPS as the 
sulfate donor m a final vohnBB of 30|il, contams 50 mM Hepes-NaOH (pH 7.0), 250 mM snoDse, 
1 mM dithiothreitol, 14 ^tM[^^S]PAPS (15 CiAnmol), and dopamine (25 fM), p-mtrophenol (5 

30 ^M), or other candidate substrates. Assay reactions ate started by tlie addition of a purified D 
enzyoie preparation or a saapIecoiUaining DMB activity, allowed to proceed for 15 mtnat 37 °C, 
andtenoinatedbyheatiogat 100*Cfor3inia Ihe precipitates formed ate cleared by 
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ceatrifagatiaiL Thesupematantsareflmsifl^^^ 

flim-layercliiOTiatogr^hyorat^ Appropriate 
standards are run in parallel with the supematants to allow the identificatjon of the ^S-snlfeted 
products and detmmne the enzyme specificity of the DME-contaimng samples based on relative 
5 lelates of inigtation of reaction products (Scikaldbara, Y. et al (1998) J. BioL Chem. 
273:6242-6247). 

S qoaleaie epoxidase activity of DMB is assayed in a mixture conqdsing purified DMB (or a 
crude nuxture comprisii^ DMB), 20 mM Tris-HQ (pH 7.5), 0.01 niM FAD, 0.2 unit of 
NADm-cytochrome C (P-450) reductase, 0.01 mM p*C|sqaalene (dispersed with the aid of 20 nl of 
10 Tween80),and0.2%TritonX-100. I mMNADPH is added to initiate tiie reaction foDowed by 
incubation at 37 *C for 30 min The n0nsaponiBAlelq>ids are analyzed by silica gel 
with cfliyl acetate/benzene (0.5:995, v/v). The reaction products are coii^>arcd to those from a 
reaction mature withont DME. The presence of 23(>S)-oxidosqnateiie is confirmed using appropriate 
Iqjid standards (Sakakibara, J. et aL (1995) 270:17-20). 
15 Epoxide hydrolase acdvily of DMB is detratmined by f3fc^ 

cbromatographic (GC) analysis of cAjGical extracts orby folbwing substrate depletion and diol 
production by GC analysis of reaction mixtures quenched in acetone. A sample containing DME or an 
qwxide hydrolase control sample is incubated in 10 niM Tris-HO QpH 8.0), 1 mM 
ethylenediaminetetraacetate (EDTA), and 5 mM epoxide substrate (e.g., ^jdene oxide, styrene 
20 oxide* propylene oxide, isoprene monoxide, epicMorohydrin, epibromohydrin, epiflaorohydiin, glycidol, 
1^-epoxybutane, l,2-epo3ghBxai», or 1,2-epoxyoctanea). A portion of the sample is withdrawn from 
the reaction mixture at various time points, and added to 1 ml of ice-cold acetone cont 
standard for GC analysis (e.g., l-nonanol). Protein and salts are removed by centriiugation (15 mm, 
4000 X g) and Ihe extract is analyzed by GC using a 0.2 mm x 25-m CP-Wax57-CB colunm 
25 (CHROMPACK,MddeIbxu'g,ThcNethcriaiids)andaflame-ionizati^^ Tbe identification of 

GC products is perfomied usmg appropriate standards and controls well known to diose skilled mihe 
art 1 IMtofDME activity is defined as die amount of^izyme that catalyzes die produ 
jmiol of diolAnm (Rink, R. et aL (1997) J. BioL Chena. 272:14650-14657). 

Aminotransferase activity of DME is assayed by incubating samples oontamiiig DMB for 1 
30 iKiur at 3 7 'C in the presence of 1 mM L-kynurenme and 1 mM 2-oxo^t2ira£B in a final volume of 
200 |d of 150 mMTris acetate buffer (pH 8.0) contMnmg 70 /iMPLP- ThefDrmationofkynurenic 
add is quantified by HPIX: with spectrophotometric detection at 330 nm using tOhe appropriate 
standards and controls weUbiown to diosedilled in die art Inthealtemative, 
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L-3-hydro:xyl;yatueniiie is used as substrate and Hie pxodnction of xantburenic add is determined by 
HPIX^analysisof the products wifliUV detection at 340 miL Tbepioductionof kynuieiuc acid 
xanthurenic add, respectively, is indicative of anunotransferase adtily (Buchli, R. et aL (1995) J. Biol. 
C3)em. 270:29330-29335). 
5 In another alternative, aminotransferase activity of DME is measured by determining the 

activity of purified DMB or crude samples containing DME toward various amino and oxo add 
substrates und^ single toxnovN conditions t>y tnodtonDg ^ du^ 
spectnmi of flie enzyme-bound cofector, FLP. The reactions av^ 

4-inelhyhnorpholine (pH7 J) containing 9 fiM purified DME or DME containing samples and 

10 substrate to be tested (amino and oxo add substrates). The half-ieacdon from amino acid to oxo add 
is followed by measuring die decrease in absorbance at 360 mn and fhe increase in absoifoance at 330 
' nm due to die conversion of enz^e^boimdPLP to PMP. lYie spedfidty and relativB activity of 
DME is d^ermined by the activity of the enzyme preparation sigainst spedfic substrates (Vacca, ILA. 
et ^. (1997) J. BioL Cbcm. 272:21932-21937). 

15 Superoxide d&smutase activity of DME is assayed from cell pellets, culture supematants, or 

purified protein preparations. SailqplBS or lysates are resolved by electrophoresis 
non-denaturing polyacr^anude gels. Th&gplsareiiicubatedfbr30minin2.5n]Mnitroblue* 
tetrazofinm, foOowed by incubation for 20 niunin 30 mM potassium phosphate, 30 mM TEMED, and 
30 /(M riboflavin (pH 7.8). Superoxide distmtaseadivrty is visualized as wlute bands agam^ 

20 bqrlrgff'^T'd^ finlViwilig iTIntni^arinn of tiie gel;; on a 1ighfl>ox- Quantitation of superoxide dismutase . 
activity is performed by densatometric scanning of die activily gjds ush^ flie appropriate superoxide 
dismutase podtive and negative controls (e.g., various amounts of commerdally avdlable R coH 
superoxide dkmntase (EarOi, G. and Horwitz, MJL (1999) J. Biol Chem. 274:4281-4292). 

CatediolrO-iDetliyttiansferase activity of DMB is measured in a reaction nmxture consisting 

25 of 50 JsM Tris-HQ 7.4), 1.2 nM MgCt. 200 iM SAM (S'-adeaiosyl-L-nirfhionine) iodide 
(CQDtaintng 0.5 /iCi of [n]ethyl-[^H]SAMO, 1 mM dithiotbreftol, and varymg coiicentrations of 
catedK>l substrate (e.g.,L-dopa, dopamine, or DBA) in a final voluim Tbereactionis 
initiated by the addition of 250-500 ng of purified DMB or crude DMB-containing sanqple and 
perfonnedatS? '^forSOmin. The reactnbn is arrested by rapidly cooliiig on ice and irnrnediate^^ 

30 extracting vviili? nil of ic&^oldn-beptaoie. Folbwingceatrifugationat lOOOxgfor lOmirk, 3-inl 
alx^uots of tbe^ organic extracts are analyzed for radioactivity content by liquid sdntiUalion countipg. 
The level of catecbol-assodated radioactivity in the organic phase is proportional to the activity 

no 
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catechol-O-inetJiyltransfease activity of DMB (Zlni, B.T. Liehr, J.G. (1996) 271:1357-1363). 

DHFR activity of DMB is detBrmined spectrophotometiicalty at 15 by fbllowiBg the 
disappearance of NADPH at 340 nm (8340 = 11,800 M'^-cm'^). The standard assay mixture contains 
100 iJiM NADPH, 14 mM 2-mercaptoethanol, MTEN buffer (50 mM 2-morpbolinoetiianesulfamc add, 

5 25 mM tris(hydroxymethyl)aminomethane, 25 mM ethanobmine, and 100 mM Nad, pH 7.0), and 
DMB in a final volume of 2.0 raL The reaction is started by the addition of 50 /iM dihydrofolate (as 
substrate). The oxidation of NADPH to NADF" corresponds to the reduction of dihydrofolate in the 
reaction and is proportional to the amonnt of DHFR activity in Ibe san^le ^akamnra, T. and 
Iwakura, M. (1999) J. BioL Ohcm. 274:19041-19047), 

10 SiilfotransferaseactmtyofDME is raeasured using the 

into a model substrate such as pbenol (Folds, A and Medc, J. L. (1973) Biochim. Bfophys. Acta 
327:365-374). An aliquot of enzyme is incubated at 37 ^'C vnth 1 mL of 10 mM phosphate buffer 
pH 6.4, 50 (iM pheDol, 0-4-4.0 pM p^S]PAPS. After sufekaent time for 5-20% of the radiolabel to 
betraojsfisaiedtolliesabstrate, 0.2inLof 0.1 M barium acetate is added to precipitate protein and 

15 phDq[*ate bofifer. Tben 0.2 mL of 0,1 M Ba(0H)2 is added, foHowedhy 0.2 noL ZnS04. The 
siq>ematant is cleared by cmtrifugatioii, which removes protdns as well as imreacted |j^^S]PAPS. 
Radx)activity in the siqpematant is measured by s 
thennnatb^ofniolesofradioactivity in the reaction prodi^ , 
XVm. Identificalion of DME Inhibitors 

20 Compounds to be tested are arrayed in the wells of a muM-weH plate in varying 

conc^itrati0ns along widi an appropriate ha&cr and substrate, as described in fhe assays in Example 
XVn. DMB activity is measured for each weU and the abilily of each compound to inhibit DME 
activi^ can be d^ermined, as weU as the dose-response profiles . This assay could sdso be used to 
identify molecnlBS which eDhance DME activity. 

25 

Varkius modifications and variations of the desmbed n^diods and systems of the invention 
win be i^parsDt to those skilled in die art wtthont departmg from the scope and spiiit of die mvention. 
Aldiou^ the invention has been described in connectiDn with certain embodiments, it should be 
understood fliat the invention as claimed should not be unduty limited to sudi spedfic embodiments. 
30 hideed, various modifications of the described modes for carrying out the invi^tion which are obvious 
to those skilled in molecnlar biology or related fields are intraided to be within the scope of die 
following claims. 
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What is claimed is: 

1. An isolated {K)lypq)tide selected from the group consisting of: 

a) a polypeptide con^iising an amino add sequence selected £tom &e group consisting of 
5 SBQIDN0:M9, 

b) a polypeptide compnsing a naturalty occurring amino acid sequence at least 90% identical 
to an ammo add sequence selected from die group consisting of SEQ ID NO:l-'19 , 

c) a biological]^ active fragment of a polypeptide having an amino add sequence selected 
fit>m the group comisting of SEQ ID NO:l-19 , and . 

10 d) an immunogenic firagmeut of a polypeptide having an amino acid sequence selected from 

the group consisting of SEQ ID NO:l-19 . 

2. An isolated polypeptide of claim 1 selected from the group consisting of SEQ ID NO:l-19 . 
15 3. AnisolfEtedpolyD]jdeotideaux>dingapol^^ 

4. Anisolatedpo^acleotideesDcodingapo^peptideof cldm2. 

S« An isolated polynaclBotide of claim 4 sefectedfrxim the gronpcos^ 
20 NO:20-38, 

6. AieconoibinantpolyimcleotidecMii^^ 
pol^cleotide of claim 3. 

25 7, Acelltransfinnned wiQiaiecoiiibinaDtp^^ 

8. A transgenic organism comprising a icconJiiDant polynadeotide of claim 6. 

9. A mefliod of producing a polypeptide of claim 1, the method comprising: 

30 a) cultoring a cell under conditions suitable for expression of the polypeptide, wherein said cell 

is transfomied with a reconabinant polynucleotide, and said reconaibinant polynucleotide comprises a 
promoter sequence operabfy linked to a polynucleotide encoding the polypeptide of claun 1 , and 
b) lecovering file polypeptide so expressed. 
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10. An isolated antibody wfaddi specifically binds to a potypeptide of claim 1. 

11. An isolated polynucleotide selected from the group consistii^ 

a) a polynucleotide comprising a polynucleotide sequence selected from the group consisting 

5 ofSEQIDNO:20-38, 

b) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% 
identical to a polynucleotide sequence selected jfrom liie group consisting of SBQ ID NO:20-38, 

c) a polynucleotide con^lementaiy to a polynucleotide of a), 

d) a polynucleotide compkm^itary to a polynucleotide of b), and 
10 e) an RNA equivalent of a>d). 

12. An isolated polyuaclBotidecon]$ttising at least 60 contigqo^^ 
pofymicleotide of claim 11. 

15 13. Ami^liod of delectingataigetiK>lynuclebtideinasai^^ said target polyaacleotide 

having a sequraice of apot^mncleotideof claim 11, tliemefikod compriang: 

a) hybridizii^ flie sample wifli apiobe comprising at least 20 contiguous nudeotides 
c«m?)rirfng a sequence coiiq>lementary to said • 
spedficallyhybridizes to said target polynucleotide, under conditions whereby a hybridization complex 

20 is formed between said probe and said targpt polynucleotide or fragm^ tiieieof , and 

b) detecting flie presence or abseiice of said hybridization coii5ilex,a^ 

the amount thereof . 

14. A mi^hodof claiml3,wh«tdntiiBprobecon5>risesatleast60conti^ 

25 

15. A m^hod of d^ectii^ a target polynndteotide ma sample, said targptpol^ 
haidng a seqoance of a polynucleotide of daim 11 , the mefliod comprising: 

a) ariq>lifymg said target polynucteotide or fragment 
amplification^ and 

30 b) detecting the presence or absCTce of said amplified target polynucleotide or firagment 

tiij^:eof, and, optionally, if present, the amount tisereof. 

16. A conxposition compri^ng a polypeptide of claim 1 and a pharmaceutically acceptable 
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exdpient 

17. A coiiq)osition of claim 16, wheidn flie polypeptide has an amino add sequence selected 
firom the group consisting of SEQ ID N0:l-19 . 

5 

IS. A mediod for treating a disease or condition associated with decreased expression of 
functional DME, comprising administenng to a patiy^ in need of such treatment the composition of 
claim 16. 

10 19. A method of screening a coriqx)und for effcctiviMaess as an ago 

claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting agonist activity in the sanqile. 

15 20. A composition conoqptising an agonist con^und identified by a mediod of claim 19 and a 

pharmacequtically accqptable exdpiCTt 

21. A mi^od for treating a disease or condition associated with decreased expression of 
functional DME» conqirising administering to a patient in need of such treatment a composition of 

20 claim20. 

22. A ini^hod of scxeening a conqvonnd for effectiveness as an antago^ 
chum I9 tiie me&od comprising: 

a) e:3q»osing a sample compiising a polypeptide of claim 1 to a compound, and 
25 b) 



23. A composition comprising an antagonist compound identijBed by a method of claim 22 and 
a pharmaceutically acceptable exc^ent 

30 24. A m^pd for treating a disease or condition associated with overexpression of functional 

DME, comprising administering to a pati^ in need of such treatment a composition of claim 23 . 

25. A mefliod of screeniiig for a compound Ihatspedficany binds to ihep^ 
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1, the method comprising: 

a) combming the polypqydde of claim 1 with at least one test compoimd imder suitable 
conditiDiis, and 

b) detectmg binding of the polypeptide of claim 1 to the test compoimd, thereby id^tiiying a 
5 compound that specifically binds to die polypeptide of claim 1. 

26. A mediod of scrcedbg for a con^nnd that modulates the activity of &e 
claim 1, the method comprising: 

a) combiningthepolypeptideof claim 1 with at feast one test compoondnnderc^ 
10 permissive for the activity of the pofypeptide of claim 1, 

b) assessing Ihe activity of the polypeptide of daiml in the presence of ti^ 

and 

c) comparingtheacdvityoffliepolypqrttdeof claimlin^ptesenceof the 

with die activity of the polyp^de of claim 1 infiie absenceof the test compound, whmin achange 
15 in die activity of die polypeptide of clahn 1 indie prcsoice of die test compound is indicative of a 
compound diat modulates die activity of the xx>:^peptide of clam 1 . 

■27. A mediod of scieenii]^ a coiiqxKuid for effectiveness in alteri 
polynucleotide, wh^in said target polynucfeotide comprises a sequ^ce of claim 5, the method 
20 conqirising: 

a) exposing a sample compriang die taig^ po^nacleotide to a compound, under conditions 
suitable for die expression of die target polynucteotide, 

b) detecting altered expression oflhe taiget polynucleotide, and 

c) comparingdieeT^ressionofdietaigetpolynucleotideinlheprescnceofvai^^ 
25 of the con^und and in the aibs^ace of the con^und. 

28* A method of assessing toxicity of a test compound, the method comprising: 

a) treating a biological sample containing nucldlc adds with the test con^und, 

b) hybridizing the nucleic adds of the treated biological san^le with a pmbe con^iising at 
30 least 20 contiguous nucleotides of a polynudeotide of claim 1 1 xmder conditions whereby a specific 

hybridization complex is formed between said probe and a target polynucleotide in the biological 
sanq>Ie, said targ^ polynucleotide con^prising a polynucleotide sequence of a polynucleotide of claim 
11 or fitigment thereof. 
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c) qaantifyiiiglb&aaioimt of hybridization coi^ 

d) conq>aiiDg die amount of hybridization complex in die txeated biologtcal sample widi the 
amount of hybridization complex in an untreated biological sample, wh^tein a di^ience in file amount 
of hybridization complex in die treated biological sample is indicatiye of toxicity of flie test compound. 

5 

• 29. A (fiagqosdc test for a condition or disease associated widi the espiesd 
biobgical sample, the mdhod compising: 

a) combining die biological sample vnSi an antibody of claim 10» under conditions suitable for 
die antibody to bind die polypeptide and form an antibody:polypeptide complex* and 
10 h) detectingthecon^les, wherein the presence of thecompkxc^ 

of die polypeptide in die biological sampK 

30. The antibody ofclaimlO»whardn die antibody is: 
a) a cfaim^c antibody^ 
is b) a single chain antibody, 

c) aFabfiragmi^ 

d) aF(ab')2firagment,or 

e) a humanized antibody. 

20 31. A co]iqK>sition comprising an antibody of claim 10 and an acceptable exidpient 

32. A method of diagnosing a condition or disease assodated with the expression of DME in 
a subject, con^rising admioistering to said subject an ejSiBctive amount of die composition of claim 31. 

25 33. AcosG^sitionof c]aim31, whar^dieantibodyislabeled. 

34. A mediod of diagnosing a condition or disease assodated with the espression of DME in 
a subject, compxisiDg sUlvnm\stmng to said subject an ^fective amount of the composition of claim 33. 

30 35. A method ofprqparing a polyclond antibody widi die spedfikd 

10, die mediod comprismg: 

a) imnmnlTang an gfiimal witii a polypeptide having an amino add sequence selected from the 
groupcoifiistmgof SBQIDNO:l-19 ,or anitmmiiiogenicftagnQeintdi£m>f,^^ 
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aa antibody response, 

b) isoladng antibodies fram said ani^^ 

c) scteedDg the isolated antibodies with the polypeptide, 

antibody whidi binds spedficalfy to a pofypeptide having an anuno adid seqaetice selected firom tbe 
5 group consisting of SEQ ID NO:l-19 * 

36. Anantibodlyprodacedbyamediodof claim35. 

37. Aco]npositioncompn5ingtbeantibodyofclaim36andasiut£^ 

10 

38. Amethodofiualdbdigaimnocknalan^ 
10» die mediod con^risipg: 

a) iTTiTnnTiiTiTig an antmaJ wifli a polypq)tide having an amino acid sequence selected from the 
gnmp consistii^ of SBQ ID N0:l-I9 » or an imnnmo^nic fragment thereof, und^ conditions to elidt 

IS an antibody response, 

b) isolating antibody producing cells from the animal, 

c) ftising the antibody producing cells \idthimnK}rta]izedce^ 
producing hybridoma cells, 

d) cultoring the hybridoma cells, and 

20 e) isolating from the culture monoclonal antibody which binds spedfically to a polypeptide 

having an amino acid sequeav^ selected from die group consisting of SEQ ID N0:l-I9 . 

39. AmoBodonalantibodyproducedby a method of claim 38. 

25 40. A composition coniprising d^ antibody of claim 39 and a suitable cani^. 

41. The antibody of claim 10, wherein the antibody is produced by screening a Fab e?q»rei5sion 

Kbiaiy. 

30 42. Ihe antibody ofdaim 10, wherdn the antibody is produced by scre^^^ 

hmmmoglobxifin libraiy. 

43. A method of detecttne a poIvoeDtide bavins an amino ^d sequence selected from die 



139 



wo 02/12467 



PCT/USOl/24382 



groupconsistiDgof S£QIDNO:l-19 in a sample, the method coiiqirising: 

a) incubatingdieantibodyof claimlO with asampbiii^ 
of the antibody and the polypeptide, and 

b) detecting spedfic binding, wbeidnspedficbindi^ 

5 haidng an amino acid sequence selected fiom the group consisting of S£Q ID N0:l-19 in flie sample. 

44. A method ofpaiiFymg a polypeptide ha%^ 

group consisting of SEQ ID NO:l-19 fiom a sample, die method con^ptising: 

a) incabating die antibody ofclaim 10 witti a sampfennd^ conditions to {fl^ 
10 of the antibody and die polypeptide, and 

b) sqiarating die antibody finom die sample and obtaining the pi^^ 
amino add sequence sebcted from the gronp coosistipg of SBQ ID NO:l'19 . 

45. A po][ypcptide of claim 1, conqirising die annuo acid sequence 

15 

46. Apolypeptideofclaiml, compiisingdieaniino add sequence of SBQ I^ 

47. A polypeptide of claim 1, conqirisit^ the amino add sequtaice of SEQ ID N0:3. 
20 48. A polypeptide ofc]aiml,conq>]istng die amino add sequence of SEQ ID NO:4. 

49. Apolypeptideof claim 1» comprising die amino add sequ^ce of SBQ ID N(>.5. 

50. A polypeptide of claim 1, conqinsing the amino add sequ^ce of SBQ ID NO:6. 

25 

51. A polypeptide of claim 1, comprising the amino add sequ^oce of SBQ ID NO:7. 

52. Apolypqitideof claim l,conq)rising die anuno add sequence of SEQ n>N0:8. 
30 53. Apo^pq>tidBofclaiml»co]iqHisingdieaminoadd 

54. Apo^peptideofclaiml,con4xrisit^theaminoaddsequ€ai^ 
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10 



55. A polypeptide of claim lyCQtiqnisiiigflieaiiim^ 

56. A po^peptide of claim 1, coiiq>rising the amino add sequ^ice of SEQ JD NO:12. 

57. A polypqptide of claim 1, conqinsjiig the amino add seqaence of SEQ ID NO:13. 

58. A polypeptide of claim l,coii^iisingflieamii» 

59. A polypeptide ofclaiml,conqni^iig die amiiiD add seqa^ice of SBQ 

60. A polypeptide of claim 1 , comprising flie amipo add seqaence of SEQ ID NO:16. 

61. A polyp^itide of claim 1, con^irising the amino add sequence of SEQ ID N0:17. 
15 62. A polypeptide ofdaimlycon^xrisiiig die amino add seq^ 

63. A polypeptide of claim 1, comprising tiie amino add sequence of SEQ ID NO:19. 

64. A pofyuucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID NO:20. 

20 

65. A pofynucleotide of claim 11, composing the polynucleotide sequence of SEQ ID N0:21. 

66. A po^ucleotide of claim 1 1, comprising die polynucleotide sequence of S£Q ID NO:22. 
25 67. A polynucleotide of claim 11, comprising die polynucleotide sequence of SEQ ID N023. 

68. A polyimclBotide of claim 1 1, comprising the polynucleotide sequaoce of SEQ ID NO:24. 

69. A po^ucleotide of claim 11, conoiifising die polynucleotide sequ^ice of SEQ ID NO:25. 

70. A polynucleotide of daim 1 1, compising die polymideotide sequ^tice of SEQ ID NO:26. 

71. A pofymicleotide of cl^m 11, comprising d^ 
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72. A polynaclcotidc of claim 11, compiisuiig die polynucleotide scqaesice of SEQ ID NO:28. 

73. A polynucleotide of claun 11, comprismg fte polytmcfeotide sequence of SEQ ID NO:29. 

74. Apolynuckotideof claim II, coniprismg the polynucleotide seqaoDce of SEQ ID NO:30. 

75. Apol^cleotideof claim 11, comprising die polynucleotide sequeiiGe of SBQI^ 

76. A polynucleotide of claim 11, conq>rising the polynucleotide sequence of SEQ ID NC>:32. 

77. A *pol3^c]eotide of claim 11, comprising the polynnckotide sequence of SEQID NO:33. 

78. A polynucleotide of datm 11, comprising the fwlynucleotide sequence of SEQ ID NO:34. 

79. Apo^nncfeotideof dlaimll,conqiristngtiiepol^ 

80. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID NO:36. 

8 1 . A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID NO:37. 

82. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID NO:38. 
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<110> INCXTE GENOMICS, INC. 
BAXJBHN, Mariah R. 
BRUNS, Chris tofph.er M- 
DAS, Debopriya 
DELBGEANE, Angelo H. 
DING, Li 
ELLIOT, Vicki S. 
GANDHI, Ameena R. 
GBXFFIN, Jennifer A- 
HAFALIA, April J-A. 
KHAN, Farrah A. 
LAL, Preeti 
LEE, Sally 

LU, Dyung Aina M. * , 

LU, Yan 

PATTERSON, Chandra 
RAMKUMAR, Jayalaxmi 
RING, Huijun Z. 
SANJANWALA, Hadhn S. 
TANG, Y. Tom 
THANQAVELU, Kavitha 
THORNTON, Michael 
TRIBOXJLEY, Catherine M. 
WALIA, Narinder K- 
WARREN, Bridget A. 
YANG, Junxning 
YAO, Monique G, 
YUE, Henry 

<120> DRUG METABOLIZINQ ENZYMES 

<130> PI-0185 PCT 

<140> To Be Assigned 
<141> Herewith 

<150> 60/223,055; 60/224,728; 60/226,440; 60/228,067; 60/230,063; 60/232.244; 
60/234,269 

<151> 2000-08-04; 2000-08-11; 2000-08-18; 2000-08-24; 2000-08-31; 2000-09-13; 
2000-09-20 

<160> 38 

<170> PERL Program 

<210> 1 

<211> 756 

<212> PRT 

<213> Hcino s£qpiens 

<220> 

^ <221> inisc_feature 

<223> Incyte ID No: 7248285CD1 



<400> 1 






Met Ala Trp Ser Pro 


Pro Ala Thr Leu Phe Leu 


Phe Leu Leu Leu 


1 5 


10 


15 


Leu Gly Gin Pro Pro 


Pro Ser Arg Pro Gin Ser 


Leu Gly Thr Thr 


20 


25 


30 


Lys Leu Arg Leu Val 


Gly Pro Glu Ser Lys Pro 


Glu Glu Gly Arg 


35 


40 


45 


Leu Glu Val Leu His 


Gin Gly Gin Trp Gly Thr 


Val Cys Asp Asp 


50 


55 


60 
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Aoll 








65 




VarXU 


AXCi 


Ala 


Leu 
80 


\3±.y 




wJ-y 


Pro 


J. J.6 

95 




Ser 


oer 


Leu 


Asp 
110 


Asp 


Cys 


Ser 


rlJLS 


Ser 
125 


Arg 




Arg 


uj.y 


Tyr 
140 


Pro 




Gly 


Glu 


Arg 
155 


X T «k 


Ser 


Ala 


Lys 


oin 
170 


vaj. 


Lys 


Tyr 


Glu 


ijiy 
185 


Thr 


Met: 


Asn 


Asn 


Ser 
200 


Ser 


Glu 


Val 


±rro 


val 
215 


Leu 


Lys 


Met: 


Arg 


Asp 
230 


Lys 


Asu 


Ser 


Dim 

File 


Trp 
245 


TV——. 

irro 


£U.S 


nec 


i\-La 


Asn 
260 


Lys 


Leu 


Arg 


Pro 


Ala 
275 


cys 


VaJ. 


Ala 


Gly 


Pro 
290 


Arg" 


Lys 


Gly 


Ser 


Trp 
305 


Gly 


Ala 


Gin 


Val 


Gly 
320 


Gin 


Trp 


Gly 


Thr 


Val 
335 


Ser 


Val 


val 


Cys 


Arg 
350 


J-»©U. 


trlLe 


i3±y 


AXa. 


Arg 
365 




l^XU 


vai 


Arg 


Cys 

380 


Pro 


AJia 


Leu 




vjj.y 
395 








Arg 


Cys 
410 


VCUL 


&v-rr 


Leu 


Ala 


v»a.y 

425 


VgUL 


VJXTl 


VCLL 


04,11. 


vajL 
440 


Ser 


Glu 


Asn 


Trp 


Gly 
455 


Leu 


Gly 


Leu 


Gly 


Phe 
470 


Trp 


Ser 


Gly 


Thr 


Pro 
485 


Arg 


Cys 


Ser 


Gly 


Thr 
500 


Gly 


Pro 


Val 


His 


Cys 
515 



Glu 


Ala 


Thr 


Val 


Ala 
70 






Aia 


His 


Ser 
85 




Leu 


Asp 




Val 
veuu 

100 




Cys 


vT-iy 


Ser 


Asn 
115 




Asp 


vax 




Val 

vax 
130 


Leu 


Ser 


wJLU 


xnr 


V=»l 

vax 
145 


Leu 


Glu 


(aXU 


vax 


Arg 
160 


ms 


Ser 


Jrro 


v«.l 
val 


xnr 
175 


xilS 


Trp 


Arg 


i»j.n 


vax 
190 


Arg 


vaj. 


va± 


Cys 


v»xy 
205 


Asp 


Ser 


HXS 


Tyr 


Tyr 
220 


jrTO 


Lys 


Ser 


Arg 


Leu 
235 


Tl a 

jLxe 






Vs»l 


Thr 
250 


Cys 




vai 


vixn 


vax 
265 


Cys 


Pro 


v»xy 


laxy 


280 


ms 


"Pino 

jrne 


Arg 


rTO 


Pro 
295 


AJLSL 


urXU 


uXU 


±TO 


Arg 
310 






Arg 


vax 


VvXU 

325 


vjys 


Asp 


ItlS 


Arg 


Trp 
340 




Leu 


v>±y 


xrjj.e 


oxy 
355 


Leu 


fZl IT 

VsrXy 


vrxn 


wxy 


Leu 
370 






Tyr 




Arg 
385 








^axy 


Cys 
400 


Asn 


Val 


Pro 


Asn 


Met: 
415 


Gly 


Artr 


lie 


Pro 


Glu 
430 


Asn 


Qlv 


Val 


Pro 


Arg 
445 


Leu 


Thr 


Glu 


Ala 


Met 

460 


Ala 


11b 


His 


Ala 


Tyr 
475 


Arg 


Ala 


Gin 


Glu 


Val 
490 


Glu 


Leu 


Ala 


Leu 


Gin 
505 


Ser 


Hie 


Gly 


Gly 


Qly 
520 
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Oln 


Leu 


Gly 
75 


A1 a 


Lys 


Tyr 


wxy 


uXu 

90 


Arg 


V»yE> 


V»1 


ijxy 


ilUX 

105 


oxy 


Trp 


v»xy 


Val 

vax 


oer 

1 on 
xzu 


xxe 


cys 


nxs 


Pro 


Arg 
135 


Ser 


Asn 


Ala 


Leu 


Gly 
150 


Leu 


Lys 


Pro 


lie 


Leu 
165 


Glu 


Gly 


Ala 


Val 


Glu 
180 


Cys 


Asp 


Gin 


Gly 


Trp 
195 


Mec 


Leu 


Gly 


Pne 


Pro 










210 


Arg 


jjys 


vai 


Trp 


Asp 
225 


iiys 


Ser 


Leu 


Thr 


Asn 
240 


Cys 


Leu 


Gly 


Thr 


Glu 

255 


Ala 


Pro 


Ala 


Arg 


Gly 
270 


xilS 


Ala 


Val 


Aral 

vai 


Ser 
285 


Lys 


Thr 


Lys 


Pro 


Gin 
300 


Val 


Arg 


Leu 


Arg 


Ser 
315 


Val 


Leu 


Met 


Asn 


Arg 
330 


Asn 


Leu 


lie 


S^r 


Ala 
345 


Ser 


Ala 


Arg 


Glu 


Ala 
360 


Gly 


Pro 


lie 


IT 4 M 

uxs 


Leu 
375 


Thr 


T BS^ 

liEU 


Ser 


Asp 


Cys 

390 


isxn 


ZlXS 


(sXU 


Asn 


Asp 
405 






VJ-i. tt 


Asn 


420 


uXU 


nlxr 

wjLy 


X^eu 


Lsu 


nin 
435 


Trp 


wxy 


Ser 


Vnl 

vax 


Cys 
450 


Val 


Ala 


Cys 


Arg 


Gin 
465 


Lys 


Glu 


Thr 


Trp 


Phe 
480 


Val 


Met 


Ser 


Gly 


Val 
495 


Gin 


Cys 


Gin 


Arg 


His 
510 


Arg 


Phe 


Leu 


Ala 


Gly 
525 
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Vol Ser Cys Met Asp Ser Ala Pro Asp Leu Val Met Asn Ala Gin 

530 535 540 

Leu Val Gin Glu 'Ehr Ala Tyr Leu Glu Asp Arg Pro Lou Sor Gin 

545 550 555 

Leu Tyr Cys Ala His Glu Glu Asn Cys Leu Ser Lys Ser Ala Asp 

560 565 570 

His Met Asp Trp Pro Tyr Gly Tyr Arg Arg Leu Leu Arg Phe Ser 

575 580 585 

Tbr Gin lie Tyr Asn Leu Gly Arg Thr Asp Phe Arg Pro Lys Thr 

590 595 600 

Gly Arg Asp Ser Trp Val Trp His Gin Cys His Arg His lyr His 

605 610 615 

Ser lie Glu Val Phe Thr His Tyr Asp Leu Leu Thr Leu Asn Gly 

620 625 630 

Ser Lys Val Ala Qlu Gly His Lys Ala Ser Phe Cys Leu Glu Asp 

635 640 645 

Thr Asn cys Pro Thr Gly Leu Gin Arg Arg Tyr Ala Cys Ala Asn 

650 655 660 

Phe Gly Glu Gin Gly Val Thr Val Gly Cys Trp Asp Thr Tyr Arg 

665 670 675 

His Asp lid Asp Cys Gin Trp VelL Asp He Thr Asp Val Gly Pro 

680 685 690 

Gly Asn Tyr He Phe Gin Val He Val Asn Pro His Tyr Glu Val 

695 700 705 

Ala Glu Ser Asp Phe Ser Asn Asn Met Leu Gin Cys Arg Cys Lys 

710 715 720 

Tyr Asp Gly His Arg Val Txp Leta His Asn Cys His Thr Gly Asn 

725 730 735 

Sor Tyr Pro Ala Asn Ala Glu Leu Ser Leu Glu Gin Glu Gin Arg 

740 745 750 

Leu Arg Asn Asn Leu He 

755 

<210> 2 

<211> 544 

<212> PRT 

<213> Hcaoo sapiens 

<220> 

<221> misc__feature 

<223> Incyte ID No: 7472835CD1 

<400> 2 

Met Ala Lys Lys Ala He Ala Val He Gly Ala Gly Tie Ser Gly 
15 10 15 

Leu Gly Ala He Lys Cys Cys Leu Asp Glu Asp Leu Glu Pro Thr 

20 25 30 

Cys Phe Glu Arg Asn Asp Asp He Gly His Leu Trp Lys Phe Gin 

35 40 45 

Lys Asn Thr Ser Glu Lys Met Pro Ser He Tyr Lys Ser Val Thr 

50 55 60 

He Asn Thr Ser Lys Glu Met Met Cys Phe Ser Asp Phe Pro Val 

65 70 75 

Pro Asp His Phe Pro Asn Tyr Met His Asn Ser Lys Leu Met Asp 

80 85 -90 

Tyr Phe Gly Met Tyr Ala Thr His Phe Gly Leu Leu Asn Tyr He 

95 100 105 

Arg Phe Lys Thr Glu Vail Gin Ser Val Arg Lys His Pro Asp Phe 
110 115 120 

Ser He Asn Gly Gin Trp Asp Val Val Val Glu Thr Glu Glu Lys 
125 130 135 

Gin Glu Thr Leu Val Phe Asp Gly Val Leu Val Cys Ser Gly His 
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140 145 • 150 

His Thr Asp Pro Tyr Leu Pro Leu Gin Ser Ph© Pro Gly Met Glu 

155 160 165 

Lys Phe Glu Qly Cys Tyr Phe His Ser Arg Glu Tyr Lys Ser Pro 

170 175 180 

Glu Asp Phe Ser Gly Lys Arg He He Val He Gly He Gly Asn 

185 190 195 

Ser Gly Val Asp He Ala Val Glu Leu Ser Arg Val Ala Lys Gin 

200 205 210 

Val He Phe Leu Ser Thr Arg Arg Gly Ser Trp He Leu His Arg 

215 220 225 

Val Tip Asp Asn Gly Tyr Pro Met Asp Ser Ser Phe Phe Thr Arg 

230 235 240 

Phe Asn Ser Phe Leu Gin Lye He Leu Thr Olir Pro Gin He Asn 

245 250 255 

Asn Gin Leu Glu Lys He Met Asn Ser Arg Phe Asn His Ala His 

260 265 270 

Cys Gly Leu Gin Pro Gin His Arg Ala Leu Ser Gin His Pro Thr 

275 280 285 

Val Ser Asp Asp Leu Pro Asn His He He Ser Gly Lys Val Gin 

290 295 300 

Val Lys Pro Ser Val Lys Glu Phe Thr Glu ^Chr Asp Ala He Phe 

305 310 315' 

Glu Asp Ser Thr Val Glu Glu Asn He Asp Val Val He Phe Ala 

320 325 330 

Thr Gly Tyr Ser Phe Ser Phe Ser Phe Leu Asp Gly Leu He Lys 

335 340 345 

Val Thr Asn Asn Glu Val Ser Leu Tyr Lys Leu Met Phe Pro Pro 

350 355 360 

Asp Leu Glu Lys Pro Thr Leu Ala Val He Qly Leu He Gin Pro 

365 370 375 

Leu Gly He He Leu Pro He Ala Glu Leu Gin Ser Arg Trp Ala 

380 385 390 

Thr Arg Val Phe Lys Gly Leu He Lys Leu Pro Ser Ala Glu Asn 

395 400 405 

Met Met Ala Asp He Ala Gin Arg Lys Arg Ala Met Glu Lys Arg 

410 415 420 

Tyr Val Lys Thr Pro Arg His Thr He Gin Val Asp His He Glu 

425 430 435 

Tyr Met Asp Glu He Ala Met Pro Ala Gly Val Lys Pro Asn Leu 

440 445 450 

Leu Phe Leu Phe Leu Ser Asp Pro Lys Leu Ala Met Glu Val Phe 

455 460 465 

Phe Gly Pro Cys Thr Pro Tyr Gin Tyr His Leu His Gly Pro Glu 

470 475 480 

Lys Trp Asp Gly Ala Arg Arg Ala Asn Leu Thr Gin Arg Glu Arg 

485 490 495 

He He Lys Pro Leu Arg Thr Arg He Thr Ser Glu Asp Ser His 
500 505 510 

Pro Ser S&r Gin Leu Ser Trp He Lys Met Ala Pro Val Ser Leu 
515 520 525 

Ala Phe Leu Ala Ala Gly Leu Ala Tyr Pl^® Arg Tyr Thr Pro Tyr 
530 535 540 

Qly Lys Trp Lys 



<210> 3 

<211> 501 

<212> PRT 

<213> Homo sapiaos 



<220> 
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<221> iiiisc_f eatrure 

<223> lacyte ID No: 7476203CD1 

<400> 3 



Met 


Leu 


Ser 


Leu 


Leu 


Ser 


Gly 


Leu Ala Leu Leu Ala He Ser Phe 


1 








5 






10 15 


Leu 


Leu 


Leu 


Lys 


Leu 
20 


Glv 


Thr 


Phe Cys Tirp Asp Ar^ Ser Cys Leu 
25 30 


Pro 


Paro 


Glv 


Pro 


Leu 
35 


Pro 


Phe 


Pro Xle Leu Glv Aan Leu Tm Glxi 
40 45 


Xieii 




Phe 


Gin 


Gin 
50 


Pro 


His 


Tjeu Sei^ TiSii IiVS Asn PHe GT_ti TiVS 

55 60 




He 


Glv 


Asn 


He 
65 


Phe 


Met 


Asn Ti^i Glv Setr Ser* Val. Val PT*f> 

fUSAA UCcUL VrJbJ^ ■JwSA. ■aSP^iL VCl-L VCl^ £^iLV^ 

70 75 




Ala 


Leu 


Pro 


Leu 
80 


X>6U 


Pro 


Val Thv Pile Hi«5 Pt*o Timi Acsti Rln 
85 90 


Glv 


Val 


Leu 


Cys 


Lys 
95 


Pro 


Leu 


Tie TTiv PKe Pirci Iats P^o Phe Piro 
100 105 






Asn 


Pro 


Glv 
110 


He 


He 


Cvs Sgi^ Ser* Glv Hi ^ THr Tm Atct 
115 120 


Gin 


Lys 






Phe 
125 


cys 


Leu 


Val Met Tla Attt Glv Leu GTv Leni. 

VMi^ jnatv jn^^ \^jmjf ^7*A.jf 

130 135 


Glv 


Lys 


Leu 


Ala 


Leu 
140 


Glu 


Val 


GIti lieii Glti IjVs Glu Ala. Ala Glu 

145 150 


Lou 


Ala 


Glu 


Ala 


Pho 
155 


Arcf 


Gin 


Glti Gin Glv Lvs AircT Pro Ph© Asn 
160 165 


Piro 


Glu 


Val 


Ser* 


lie 
170 


Val 


Argf 


Ser Thr* Val Artr Val Tie Glv Ala. 

175 180 


Leu 


Val 


Phe 


Glv 


Hie 
185 


His 


Phe 


l^eii lieu Glu Aen Pi?o Tie Phe Gin 

XJ^7U ^7JkU J^J^^^ JkJL^7 JC^AJl*^ \J.t 1 1 

190 195 


Glu 


Leu 




Gin 


Ala 
200 


He 


Asp 


Pli^ filv TmPfii A1p9 VHo Vjr»l Q^T" T^T" 
205 210 


Val 






Gin 


Leu 
215 




Asp 


Val Pho PT*r\ 'Pm Ala Tipni Ovfi TTt r 

220 225 


L6U 


P3ro 


Glv 


Pro 


HXS 
230 


Gin 


Glu 


Tl*a Php Arcf Tvr Gl n Gl u Val Val 
235 240 


Leu 


Se3r 


Leu 


He 


His 
245 


Gin 


Glu 


Tie Thr Arcr Hla Lvs Leu Arcr Ala. 

250 255 


Pro 


Glu 


Ala 


Pro 


Arg" 
260 


Asp 


Phe 


Tie Ser Cys ^I^^r Leu Ala Qln Tie 
265 270 


S&r 


Lvs 


Ala 


Met 


Asp 
275 


ASD 


Pro 


Val Ser t^Kt* pL© Asn Gin Glu Asn 
280 285 


Leu 


Val 


Gin 


val 


Val 
290 


He 


Asn 


Leu Phe Leu Gly Gly Thr Asp Thr 
295 300 




Ala 


rChr 


Tlar 


Leu 
305 


Cys 


Tro 


Ala Leu Tie' His Met lie Gin His 
310 315 


Gly 


Ala 


Val 


Gin 


Glu 
320 


Thr 


Val 


Gin Leu Glu Leu Asp Glu Val Leu 
325 330 


Gly 


Ala 


Ala 


Pro 


Val 
335 


Val 


cys 


Tyrr Glu Asp Arg Lys Arg Leu Pro 
340 345 


Tyr 


Thr 


Met 


Ala 


Val 
350 


Leu 


His 


Asp Val Gin Arg Leu Ser Ser Val 
355 360 


Met 


Ala 


Met 


Gly 


Ala 
365 


Val 


Arg 


Gin Cys Val Thr Ser Thr Arg Val 
370 375 


cys 


Ser 


Tyx 


Pro 


Val 
380 


Ser 


Lys 


Gly Thr He Xle Leu Pro Asn Leu 
385 ^ 390 


Ala 


Ser 


Val 


Leu 


Tyr 
395 


Asp 


Pro 


Glu Cys Trp Glu Thr' Pro Arg Gin 
400 405 


Phe 


Asn 


Pro 


Gly 


His 
410 


Phe 


Ser 


Asp Lys Asp Gly Asn Phe Val Ala 
415 420 


Asn 


Glu 


Ala 


Phe 


Leu 
425 


Pro 


Phe 


Ser Ala Gly Thr Arg Val Tyr Pro 
430 435 
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Ala Asp 


Gin Leu 


Ala 


Gin Met Glu 


Leu 






440 






Leu Leu 


Arg Thr 


Phe 


Arg Phe Gin 


Leu 






455 






Leu Lys 


Leu Glu 


Tyr 


He Phe Gly 


Gly 






470 






Gin Glu 


He Cys 


Ala 


Val Pro Arg 


Leu 






485 






Pro Arg 


Glu Asp 


Gly 


Leu 








500 







<210> 4 

<211> 345 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> Incyte ID No: 74785830)1 

<400> 4 



Met 


Lys 


Ala 


Ala 


Val 


Trp 


Tyr 


Gly 


Gin 


1 

X 


















Glu 


Arg 


Glu 


Pro 


Lys 
20 


Glu 


Leu 


Gin 


Asp 


Val 


Ser 


Trp 


Ala 


Gly 
35 


He 


Cys 


Gly 


Thr 


Glu 


Gly 


Pro 


He 


Phe 
50 


He 


Ser 


Thr 


Glu 


Gly 


Gin 


Lys 


Ala 


Pro 
65 


Val 


Thr 


Leu 


Gly 


Val 


Glu 


Glu 


Thr 


Gly 
80 


Ser 


Gin 


Val 


Thr 


Arg 


Val 


Val 


Val 


Asn 
95 


Pro 


Thr 


Val 


Ser 


Asn 


He 


Asp 


Leu 


Tyr 
110 


Asp 


Gly 


Tyr 


Ser 


Asp 


Gly 


Gly 


Phe 


Ala 
125 


Glu 


Phe 


Thr 


Asn 


Tyr 


Lys 


Leu 


Pro 


Asp 
140 


Asn 


Val 


Ser 


Asp 


Glu 


Pro 


Thr 


Ala 


Val 
155 


Ala 


Val 


Gin 


Ala 


Leu 


Phe 


Gly 


Asp 


Thr 
170 


Val 


Ala 


He 


Phe 


Leu 


Leu 


Thr 


Val 


Val 
185 


Ala 


Ala 


Lys 


Ala 


Phe 


Val 


Phe 


Asp 


Leu 
200 


Ser 


Glu 


Glu 


Arg 


Leu 


Gly 


Ala 


Thr 


His 
215 


Ala 


He 


Asn 


Ser 


Asp 


Val 


He 


Asn 


Glu 
230 


Tyr 


Thr 


Glu 


Asn 


Glu 


Val 


Ala 


Gly 


Val 
245 


Ala 


Pro 


Thr 


Leu 


Hbx 


Lys 


Ala 


Arg 


Gly 
260 


Olir 


Val 


Val 


He 


Pro 


He 


Glu 


Trp 


Asn 
275 


Pro 


Met 


Gin 


Leu 


Leu 


Ttxr 


Ser 


Thr 


He 
290 


Ala 


Tyr 


Thr 


Pro 


He 


Asp 


Leu 


He 


Asn 


Glu 


Gly 


Asn 


Leu 
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Phe 


Leu 


Met 


Phe 


Ala 


Thr 


445 










450 


Pro 


Glu 


Gly 


Ser 


Pro 


Gly 


460 










465 


Thr 


Trp 


Gin 


Pro 


Gin 


Pro 


475 










480 


Ser 


Ser 


Pro 


Ser 


Pro 


Gly 


490 










495 


Lys 


Asp 


Val 


Arg 


Val 


Glu 


10 










IS 


Asn 


Glu 


Val 


Lys 


Val 


Lys 


25 










30 


Asp 


Leu 


His 


Glu 




Leu. 


40 










45 


Lys 


Pro 


Asp 


Pro 


Ph© 


Leu 


55 










50 


His 


Glu 


Phe 


Ala 


Glv 


Val 


70 










75 


Lys 


Phe 


Asn 




Gly 


Asp 


85 










90 




Arg 


Glu 


Lys 


Glu 


Glu 


100 










105 


Phe 


He 


Glv 


Leu 


Gly 


Ser 


115 










120 


Ala 


Pro 


Glu 


Glu 


Asn 


Veil 


13& 










135 


Lys 


Glu 


Glv 


Ala 


Leu 


Val 


145 










150 


He 


Lvs 


Glu 


Gly 


Glu 


Val 


160 










165 


Glv 


Ala 


Glv 


Pro 


He 


Glv 


175 










180 


Ala 


Glv 


Ala 


Ser 


Lys 


He 


190 










195 


Leu 


Ser 


Lva 


Ala 


Lvs 


Ala 


205 










2X0 


Gly 


Lys 


Thr 


Asp 


Pro 


Val 


220 










225 


Gly 


Val 


Asp 


Val 


Ser 


Phe 


235 










240 


Lys 


Ser 


Ser 


He 


Asp 


Val 


250 










255 


Val 


Ser 


He 


Phe 


Gly 


His 


265 










270 


Thr 


Asn 


Thr 


Gly 


Val 


Lys 


280 










285 


Thr 


Thr 


Phe 


Gin 


Oln 


Thr 


295 










300 


Asn 


Val 


Lys 


Asp 


Val 


Val 
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305 310 315 

Thx Asp Glu He Glu Leu Glu Asn He Val Glu Ser Gly Phe Glu 

320 325 330 

Gin Leu Val Asn Asp Lys Ser Gin Ala Lys He Leu Xle Lys Leu 

335 340 345 



<210> 5 

<211> 361 

<212> PM 

<213> HamD sapiens 

<220> 

<221> inisc_feature 

<223> Incyte ID No: 7478585Ca)l 

<400> 5 



Met 


Ser 


Ala 


Gin 


Phe 


Glu 


Asn 


Val 


Gin Asn 


Pro 


Ser 


He 


Thr 


Arg 


1 








5 








10 










15 


Glu 


Asp 


Val 


Ala 


Glu 
20 


Val 


Leu 


Val 


Ser Val 
25 


Leu 


Thr 


Asp 


Glu 


Thr 
30 


Leu 


Gin 


Val 


Val 


Leu 
35 


Ala 


Lys 


Arg 


Pro Gin 
40 


Ser 


He 


Pro 


Gin 


Asp 
45 


Asp 


Val 


Phe 


Arg 


Phe 
50 


Glu 


Thr 


He 


Glu Thr 
55 


Arg 


Glu 


Pro 


His 


Ala 
60 


Gly 


Qlu 


Val 


Gin 


Val 
65 


Glu 


Ser 


He 


Tyr Val 
70 


Ser 


Val 


Asp 


Pro 


Tyr 

75 


Met 


Arg 


Gly 


Arg 


Met 
80 


Asn 


Asp 


Thr 


Lys Ser 
85 


Tyr 


Val 


Gin 


Pro 


Phe 
90 


Gin 


Val 


Asn 


Glu 


Pro 
95 


Leu 


Gin 


Gly 


His He 
100 


Val 


Gly 


Lys 


Val 


Thr 
105 


Gin 


Ser 


Asn 


Asp 


Glu 
110 


Arg 


Leu 


Ser 


Val Gly 
115 


Asp 


Tyr 


Val 


Thr 


Gly 
120 


He 


Leu 


Pro 


Trp 


Lys 
125 


Lys 


He 


Asn 


Thr Val 
130 


Asn 


Gly 


Asp 


Asp 


Val 
135 


Thr 


Pro 


Val 


Pro 


Ser 
140 


Lys 


Asp 


Val 


Pro Leu 
145 


His 


Leu 


Tyr 


Leu 


Ser 
150 


Val 


Leu 


Gly 


Met 


Pro 
155 


Gly 


Met 


Thr 


Ala Tyr 
160 


Thr 


Gly 


Leu 


Leu 


Gin 
165 


He 


Gly 


Gin 


Pro 


Gin 
170 


Ser 


Gly 


Glu 


Thr Val 
175 


Val 


Val 


Ser 


Ala 


Ala 
180 


Ser 


Gly 


Ala 


Val 


Gly 
185 


Ser 


Val 


Val 


Gly Qln 
190 


He 


Ala 


Lys 


He 


Lys 
195 


Gly 


Ala 


Lys 


Val 


Val 
200 


Gly 


He 


Ala 


Oly Qly 
205 


Lys 


Gin 


Lys 


Thr 


Thr 
210 


Tyr 


Leu 


T!hr 


Asp 


Glu 
215 


Leu 


Gly 


Phe 


Asp Ala 
220 


Ala 


He 


Asp 


Tyr 


Lys 
225 


Gin 


Asp 


Asp 


Phe 


Ala 
230 


Gin 


Gin 


Leu 


Glu Ala 
235 


Ala 


Val 


Pro 


Asp 


Gly 
240 


He 


Asp 


Val 


Tyr 


Phe 
245 


Glu 


Asn 


Val 


Gly Gly 
250 


Val 


He 


Ser 


Asp 


Glu 

255 


Val 


Phe 


Lys 


His 


Leu 
260 


Asn 


Arg 


Phe 


Ala Arg 
265 


Val 


Pro 


Val 


Cys 


Gly 
270 


Ala 


He 


Ser 


Ala 


Tyr 
275 


Asn 


Asn 


Glu 


Lys Asp 
280 


Asp 


He 


Gly 


Pro 


Arg 
285 


He 


Gin 


Gly 


Thr 


Leu 
290 


He 


Lys 


Asn 


Gin Ala 
295 


Leu 


Met 


Gin 


Gly 


Phe 
300 


Val 


Val 


Ala 


Gin 


Phe 
305 


Ala 


Asp 


His 


Phe Lys 
310 


Glu 


Ala 


Ser 


Glu 


Gin 
315 


Leu 


Ala 


Gin 


Trp 


Val 
320 


Ser 


Glu 


Gly 


Lys He 
325 


Lys 


Phe 


Glu 


Val 


Ihr 
330 


He 


Asp 


Glu 


Gly 


Phe 


Asp 


Asn 


Leu 


Pro Ser 


Ala 


Phe 


Arg 


Lys 


Leu 
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335 



340 



345 



Phe Thr Gly Asp Asn Phe Gly Lys Gin Val Val Lys lie Lys Glu 
350 355 360 

Glu 



<210> 6 

<211> 499 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> inisc_f eature 

<223> Incyte ID No: 7479904CD1 

<4D0> 6 

Met Asp Glu Lys Ser Asn Lys Leu Leu Leu Ala Leu Val Met Leu 
1 5 10 15 

Phe Leu Phe Ala Val He Val Leu Gin Tyr Val Cys Pro Gly Thr 
20 25 30 

Glu Cys Gin Leu Leu Arg Leu Gin Ala Phe Ser Ser Pro Val Pro 
35 40 45 

Asp Pro Tyr Arg Ser Olu Asp Glu Ser Ser Ala Arg Phe Val Pro 
• 50 55 60 

Arg Tyr Asn Phe Hir Arg Gly Asp Leu Leu Arg Lys Val Asp Phe 
65 70 75 

Asp Ho Lys Gly Asp Asp Leu lie Val Phe Leu His lie Gin Lys 
80 85 90 

Ihr Gly Gly Thr Thr Phe Gly Arg His Leu Val Arg Asn He Gin 
95 100 105 

Leu Glu Gin Pro Cys Glu Cys Arg Val Gly Gin Lys Lys Cys Thr 
110 115 120 

Cys His Arg Pro Gly Lys Arg Glu Thr Trp Leoi Phe Ser Arg Phe 
125 130 135 

Ser Thr Gly Trp Ser Cys Gly Leu His Ala Asp Trp Thr Glu Leu 
140 • 145 150 

Thr Ser Cys Val Pro Ser Val Val Asp Gly Lys Arg Asp Ala Arg 
155 160 165 

Leu Arg Pro Ser Arg Trp Arg He Phe Gin He Leu Asp Ala Ala 
170 175 180 

Ser Lys Asp Lys Arg Gly Ser Pro Asn Thr Asn Ala Gly Ala Asn 
185 190 195 

Ser Pro Ser Ser Thr Lys Thr Arg Asn Thr Ser Lys Ser Gly Lys 
200 205 210 

Asn Phe His Tyr He Thr He Leu Arg Asp Pro Val Ser Arg Tyr 
215 220 225 

Leu Ser Glu Trp Arg His Val Gin Arg Gly Ala Thr Tcp Lys Ala 
230 235 240 

Ser Leu His Val Cys Asp Gly Arg Pro Pro Thr Ser Glu Glu Leu 
245 250 255 

Pro Ser Cys Tyr Thr Gly Asp Asp Trp Ser Gly Cys Pro Leu Lys 
260 265 270 

Glu Phe Met Asp Cys Pro Tyr Asn Leu Ala Asn Asn Arg Gin Val 
275 280 285 

Arg Met Leu Ser Asp Leu Thr Leu Val Gly Cys Tyr Asn Leu Ser 
290 295 300 

Val Met Pro Glu Lye Gin Arg Asn Lys Val Leu Leu Glu Ser Ala 
305 310 315 

Lys Ser Asn Leu Lys His Met Ala Phe Phe Gly Leu Thr Glu Phe 
320 325 330 

Gin Arg Lys Thr Gin Tyr Lou Phe Glu Lys Thr Phe Asn Met Asn 



335 



340 



345 
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Phe lie Ser Pro Phe Thr Gin Tyr Asn Thr Thr Arg Ala Ser Ser 

350 355 360 

Val Olu He Asn Glu Glu He Gin Lys Arg He Glu Gly Leu Asn 

365 370 375 

Phe Leu Asp Met Glu Leu Tyr Ser Tyr Ala Lys Asp Leu Pile Leu 

380 385 390 

Gin Arg Tyr Gin Phe Met Arg Gin Lys Glu His Gin Glu Ala Arg 

395 400 405 

Arg Lys Arg Gin Glu Gin Arg Lys Phe Leu Lys Qly Arg Leu Leu 

410 415 420 

Gin Thr His Phe Gin Ser Gin Gly Gin Gly Gin Ser Gin Asn Pro 

425 430 435 

Asn Gin Asn Gin Sor Gin Asn Pro Asn Pro Asn Ala Asn Gin Asn 

440 445 450 

Leu Thr Gin Asn Leu Met Gin Asn Leu Thr Gin Ser Leu Ser Gin 

455 460 465 

Lys Glu Asn Arg Glu Sor Pro Lys Gin Asn Ser Oly Lys Olu Gin 

470 475 480 

Asn Asp Asn Thr Ser Asn Oly Thr Asn Asp Tyr He Gly Ser Val 

485 490 495 

Glu Lys Trp Arg 



<210> 7 
<211> 222 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> miscjeat\nre 

<223> mcyte ID Wo: 74803 67C3)1 



<400> 7 

Met Ala Glu Lys Pro Lys Leu His Tyr Ser Asn Ala Arg Gly Ser 
1 5 10 15 

Met Glu Ser He Arg Trp Leu Leu Ala Ala Ala Gly Val Glu Leu 
20 25 30 

Glu Glu Lys Phe Leu Glu Ser Ala Glu Asp Leu Asp Lys Leu Arg 
35 40 45 

Asn Asp Gly Ser Lou Leu Phe Gin Gin Val Pro Met Val Glu He 
50 55 60 

Asp Gly Met Lys Leu Val Gin Thr Arg Ala He Leu Asn Tyr He 
65 70 75 

Ala Ser Lys Tyr Asn Leu Tyr Oly Lys Asp Mot Lys Glu Arg Ala 
80 85 90 

Leu He Asp Met a*yr Thr Glu Gly He Val Asp Leu Thr Glu Met 
95 100 105 

H© Leu Lou Leu Leu He Cys Gin Pro Glu Olu Arg Asp Ala Lys ' 

110 115 120 

Thr Ala Leu Val Lys Glu Lys He Lys Asn Arg Tyr Phe Pro Ala 
125 130 135 

Phe Glu Lys Val Leu Lys Sor His Arg Gin Asp Tyr Leu Val Gly 
140 145 150 

Asn Lys Leu Ser Trp Ala Asp He His Leu Val Glu Leu Phe Tyr 
155 160 165 

Tyr Val Glu Glu Leu Asp Ser Ser Leu He Ser Ser Phe I^o Leu 
170 175 180 

Leu Lys Ala Leu Lys Thr Arg He Ser Asn Leu Pro Thr Val Lys 
185 190 195 

Lys Phe Leu Gin Pro Gly Ser Gin Arg Lys Pro Pro Met Asp Glu 
200 205 210 

Lys Ser Leu Glu Glu Ala Arg Lys He Phe Arg Phe 
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2X5 220 

<210> 8 

<211> 330 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> IiiLcyte ID No: 8069390C©1 

<400> 8 

Met Ala Ala Val Asp Ser Phe Tyr Leu Leu Tyr Arg Glu lie Ala 
15 10 15 

Arg Ser Cys Asn Cys Tyr Met Glu Ala Leu Ala Leu Val Gly^Ala 
20 25 30 

Tip Tyr Thr Ala Arg Lys Ser lie Thr Val lie Cys Asp PliG Tyr 
35 40 45 

Ser Leu II© Arg Leu Hi3 Vhe lie Pro Arg Leu Gly Ser Arg Ala 
50 55 60 

Asp Leu lie Lys Gin Tyr Gly Arg Trp Ala Val Val Ser Gly Ala 
65 70 75 

Hit Asp Gly lie Gly Ly3 Ala Tyr Ala Glu Glu Leu Ala Ser Arg 
80 85 90 

Gly Leu Asn He Tie Leu He Ser Arg Asn Glu Glu Lys Leu Gin 
95 100 105 

Val Val Ala Lys Asp He Ala Asp Thr Tyr Lys Val Glu Thr Asp 

110 115 120 

lie He Val Ala Asp Phe Ser Ser Gly Arg Glu He Tyr Leu Pro 

135 130 135 

He Arg Glu Ala Leu Lys Asp Lys Asp Val Gly He Leu Val Asn 

140 145 150 

Asn Val Gly Val Efhe Tyr Pro Tyr Pro Gin a*yr Phe Thr Gin Leu 

155 16a 165 

Ser Glu Asp Lys Leu T3cp Asp He He Abb Val Asn He Ala Ala 

170 175 180 

Ala Ser Leu Met Val His Val Val Leu Pro Gly Met Val Glu Arg 

185 190 195 

Lys Lys Gly Ala He Val Thr He Ser Ser Gly Ser Cys Cys Lys 

200 205 210 

Pro Thr Pro Gin Leu Ala Ala Phe Ser Ala Ser Lys Ala Tyr Leu 

215 220 225 

Asp His Phe Ser Arg Ala Leu Gin Tyr Glu Tyr Ala Ser Lys Gly 

230 235 240 

He Phe Val Gin Ser Leu He Pro Phe Tyr Val Ala Thr Ser Met 

245 250 255 

Thr Ala Pro Ser Asn Phe Leu His Arg Cyp Ser Trp Leu Val Pro 

260 265 270 

Ser Pro Lys Val Tyr Ala His His Ala Val Ser Thr Leu Gly He 

275 280 285 

Ser Lys Arg Thr Thr Gly 1*yr Trp Ser His Ser He Gin Phe Leu 

290 295 300 

Phe Ala Gin Tyr Met Pro Glu Trp Leu Trp Val Txp Gly Ala Asn 

305 310 315 

He Leu Asn Arg Ser Leu Arg Lys Glu Ala Leu Ser cys Thr Ala 

320 325 330 



<210> 9 

<211> 303 

<212> PRT 

<213> HazBo se^iens 
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<220> 

<221> inisc_fGatur© 

<223> Incyte ID No: 7473869CD1 



<400> 9 



Met 


Tyr 


Val 


Ser 


Thr 


Arg Glu Arg Tyr Lys Trp Leu Arg 


Phe Ser 


1 








5 


10 


15 


Glu 


Asp 


cys 


Leu 


Tyr 


Leu Asn Val Tyr Ala Pro Ala Arg 


Ala Pro 










20 


25 


30 


Gly 


Asp 


Pro 


Gin 


Leu 


Pro Val Met Val Trp Phe Pro Gly 


Gly Ala 










35 


40 


45 


Phe 


lie 


Val 


Gly 


Ala 


Ala . Ser Ser Tyr Glu Gly Ser Asp 


Leu Ala 










50 


55 


60 


Ala 


Arg 


Glu 


Lys 


Val 


Val Leu Val Phe Leu Gin His Arg 


Leu Gly 










65 


70 


75 


lie 


Phe 


Gly 


Phe 


Leu 


Ser Thr Asp Asp Ser His Ala Arg 


Gly Asn 










80 


85 


90 


Tzp 


Gly 


Leu 


Leu 


Asp 


Gin Met Ala Ala Leu Arg Trp Val 


Gin Glu 










95 


100 


105 


Asn 


He 


Ala 


Ala 


Phe 


Gly Gly Asp Pro Gly Asn Val 'ffbx 


Leu Phe 










110 


115 


120 


Gly 


Gin 


Ser 


Ala 


Gly 


Ala Met Ser He Ser Gly Leu Met 


Met Ser 










125 


130 


135 


Pro 


Leu 


Ala 


Ser 


Gly 


Leu Phe His Arg Ala He Ser Gin 


Ser Gly 










140 


145 


150 


Thr 


Ala 


Leu 


Phe 


Arg 


Leu Phe He Thr Ser Asn Pro Leu 


Lys Val 










155 


160 


165 


Ala 


Lys 


Lys 


Val 


Ala 


His Leu Ala Gly cys Asn His Asn 


Ser Thr 










170 


175 


180 


Gin 


He 


Leu 


Val 


Asn 


Cys Leu Arg Ala Leu Ser Gly Thr 


Lys Val 










185 


190 


195 


Met 


Arg 


Val 


Ser 


Asn 


Lys Met Arg Phe Leu Gin Leu Asn 


Phe Gin 










200 


205 


210 


Arg 


Asp 


Pro 


Glu 


Glu 


He He Trp Ser Met Ser Pro Val 


Val Asp 










215 


220 


225 


Gly 


Val 


Val 


He 


Pro 


Asp Asp Pro Leu Val Leu Leu Thr 


Gin Gly 










230 


235 


240 


Lys 


Val 


Ser 


Ser 


Val 


Pro Tyr Leu Leu Gly Val Asn Asn 


Leu Glu 










245 


250 


255 


Phe 


Asn 


Trp 


Leu 


Leu 


Pro Tyr He Met Lys Phe Pro Lou 


Asn Arg 










260 


265 


270 


GllL 


Ala 


Met 


Arg 


Lys 


Glu Thr He Thr Lys Met Leu Trp 


Ser Thr 










275 


280 


285 


Arg 


Thr 


Leu 


Leu 


Val 


Arg Asp Pro Ala Gly Arg Gly Ala 


Gin Phe 










290 


295 


300 


Gly 


Gin 


Gly 











<210> 10 

<211> 584 

<212> PRT 

<213> Hosno salens 

<220> 

<221> iiLLsc_feature 

<223> Incyte ID No: 7478588CD1 

<400> 10 

Met Pro Ser Thr Val Leu Pro Ser Thr Val Leu Pro Ser Leu Leu 
15 10 15 

Pro Thr Ala Gly Ala Gly Trp Ser Met Arg Trp He Leu Cys Trp 
20 25 30 
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Ser Leu Thr Leu Cys Leu Met Ala Gin Thr Ala Leu Gly Ala Leu 
35 40 45 

His Olir Lys Arg Pro Gin Val Val TSir Lys Tyr Gly Thr Leu Gin 
50 55 60 

Gly Lys Qln Met His Val Gly Lys ar Pro lie Gin Val Phe Leu 
65 70 75 

Gly Val Pro Phe Ser Arg Pro Pro Leu Gly He Leu Arg Phe Ala 
80 85 90 

Pro Pro Glu Pro Pro Glu Pro Trp Lys Gly He Arg Asp Ala Tlir 
95 100 105 

Uir Tyr Pro Pro Gly Cys Leu Gin Glu Ser Trp Gly Gin Leu Ala 

110 • 115 120 

Ser Met Tyr Val Ser Tlir Arg Glu Arg Tyr Lys Trp Leu Arg Phe 

125 130 135 

Ser Glu Asp Cys Leu Tyr Leu Asn Val Tyr Ala Pro Ala Arg Ala 

140 145 150 

Pro Gly Asp Pro Gin Leu Pro Val Met Val Trp Phe Pro Gly Qly 

155 160 165 

Ala Phe He Val Gly Ala Ala Ser Ser Tyr Glu Gly Ser Asp Leu 

170 175 180 

Ala Ala Arg Glu Lys Val Val Leu Val Phe Leu Gin His Arg Leu 

185 190 195 

Gly He Phe Gly Phe Leu Ser Thr Asp Asp Ser His Ala Arg Gly 

200 205 210 

Asn Trp Gly Leu Leu Asp Gin Met Ala Ala Leu Arg Trp Val Gin 

215 220 225 

Glu Asn He Ala Ala Phe Gly Gly Asp Pro Gly Asn Val Olir Leu 

230 235 240 

Phe Gly Gin Ser Ala Qly Ala Met Ser He Ser Qly Leu Met Met 

245 250 255 

Ser Pro Leu Ala Ser Gly Leu Phe His Arg Ala He Ser Qln Ser 

260 265 270 

Gly Thr Ala Leu Phe Arg Leu Phe He Thr Ser Asn Pro Leu Lys 

275 280 285 

Val Ala Lys Lys Val Ala His Leu Ala Qly Cys Asn His Asn Ser 

290 295 300 

Thr Gin He Leu Val Asn Cys Leu Arg Ala Leu Ser Qly Thr Lys 

305 310 315 

Val Met Arg Val Ser Asn Lys Met hrg Phe Leu Qln Leu Asn Phe 

320 325 330 

Gin Arg Asp Pro Glu Glu He He Trp Ser Met Ser Pro Ved Val 

335 340 345 

Asp Gly Val Val He Pro Asp Asp Pro Leu Val Leu Leu Hbr Gin 

350 355 360 

Gly Lys Val Ser Ser Val Pro Tyr Leu Leu Qly Val Asn Asn Leu 

365 370 375 

Glu Phe Asn Trp Leu Leu Pro Tyr He Met Lys Phe Pro Leu Asn 

380 385 390 

Arg Gin Ala Met Arg Lys Glu Thr He Thr Lys Met Leu Trp Ser 

395 400 405 

Thr Arg Thr Leu Leu Asn He Thr Lys Glu Gin Val Pro Leu Val 

410 415 420 

Val Glu Glu Tyr Leu Asp Asn Val Asn Glu His Asp Trp Lys Met 

425 430 435 

Leu Arg Asn Arg Met Met Asp He Val Gin Asp Ala Thr Phe Val 

440 445 450 

Tyr Ala Thr Leu Gin Thr Ala His Tyr His Arg Asp Ala Gly Leu 

455 460 465 

Pro Val Tyr Leu Tyr Glu Phe Glu His His Ala Arg Gly He He 

470 475 480 

Val Lys Pro Arg Thr Asp Gly Ala Asp His Gly Asp Glu Met Tyr 

485 490 495 
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Phe 


Leu 


Phe Gly 


Glv 


Pro Phe Ala. 








500 




Qlu 


Lys 


Ala Leu 


Ser 


Leu Gin Met 








515 




Ala 


Arg 


Thr Gly 


Asn 


Pro Asn Asp 








530 




Arg 


Tyr 


Asn Lys 


Asp 


Glu Lys Tyr 








545 




Arg 


Val 


Gly Met 


Lys 


Leu Lys Glu 








560 




Ser 


Lea 


Tyr Gin 


Sei; 


Gin Arg Pro 








575 





<210> 11 

<211> 508 

<212> PRT 

<213> Homo sapieas 

<220> 

<221> misc^feature 

<223> Incyte ID No: 55046125CD1 

<400> 11 



Met 


His 


Val 


Leu 


Arg 


Arg 


Arg 


Trp 










D 








Ala 


Leu 


Leu 


Thr 


Arg 


Gly 


Leu 


Ala 


His 


Val 


Leu 


Glv 


Ala 
35 


He 


Phe 


Ser 


Ser 


Val 


Gly 


Asn 


Met 
50 


Asp 


Glu 


Lys 


Leu 


Val 


Met 


Leu 


Phe 
65 


Lou 


Phe 


Ala 


Cya 


Pro 


Gly 


Thr 


Glu 
80 


Cys 


Gin 


Leu 


Ser 


Pro 


Val 


Pro 


Asp 
95 


Pro 


Tyr 


Arg 


Arg 


Phe 


Val 


Pro 


Arg 
110 


Tyr 


Asn 


Phe 


Lys 


Val 


Asp 


Phe 


Asp 
125 


He 


Lys 


Gly 


His 


lie 


Gin 


Lys 


Thr 
140 


Gly 


Gly 


Thr 


Arg 


Asn 


He 


Gin 


Leu 
155 


Glu 


Gin 


Pro 


Lys 


Lys 


Cys 


Thr 


cys 
170 


His 


Arg 


Pro 


Phe 


Ser 


Arg 


Phe 


Ser 
185 


Thr 


Gly 


Trp 


Trp 


Thr 


Glu 


Leu 


Thr 
200 


Ser 


cys 


Val 


Arg 


Asp 


Ala 


Arg 


Leu 
215 


Arg 


Pro 


Ser 


lie 


Leu 


Arg 


Asp 


Pro 
230 


Val 


Ser 


Arg 


Val 


Glii 


Arg 


Gly 


Ala 
245 


Thr 


Trp 


Lys 


Gly 


Arg 


Pro 


Pro 


Thr 
260 


Ser 


Glu 


Glu 


Asp 


Asp 


Trp 


Ser 


Gly 
275 


cys 


Pro 


Leu 


Tyr 


Asn 


Leu 


Ala 


Asn 


Asn 


Arg 


Gin 



Thr 


Gly 


Leu Ser 


Met 


Gly 


Lys 




505 








510 


Met 


Lys 


Tyr Txp 


Ala 


Asn 


Phe 




520 








525 


Gly 


Asn 


Leu Pro 


Cys 


Tip 


Pro 




535 








540 


Leu 


Gin 


Leu Asp 


Phe 


Thr 


Thr 




550 








555 


Lys 


Lys 


Met Ala 


Phe 


Trp 


Met 




565 








570 


Glu 


Lys 


Gin Arg 


Gin 


Phe 






580 










Asp 


Leu 


vjj,y o fcix 


Leu 


cys 


Arg 




10 








15 




Leu 






Lsu 


Lys 




25 








30 


Jjys 






It ro 








40 








45 




Asn 




Leu 


Leu 


AT n 




55 








60 






T exit 




Tyr 


VclX 




70 








75 


eu 


Arg 




Ala 


Phe 


Ser 




85 








90 












Ala 




100 








105 


-LUUu 


Al^ 






Leu 


Arg 




115 








120 


Asp 


Asp 


- 

Leu XXe 


VclI 


Phe 


Leu 




130 








135 


Thr 


Phe 




His 


Leu 


Val 




145 








150 


v-ys» 


Glu 




Val 


Glv 


Gin 




160 








165 


Glv 


Lys 


Arg Qlu 


Thr 




L6U 




175 








180 


Ser 


cys 


Gly Leu 


His 


Ala 


Asp 




190 








195 


Pro 


Ser 


Val Val 


Asp 


Glv 


Lvs 




205 








210 


Arg 


Asn 


Phe His 


Tyr 


He 


Tbr 




220- 








225 


Tyr 


Leu 


Ser Qlu 


Trp 


Arg 


His 




235 








240 


Ala 


Ser 


Leu His 


Val 


cys 


Asp 




250 








255 


Leu 


Pro 


Ser Cys 


Tyr 


Thr 


Gly 




265 








270 


Lys 


Glu 


Phe Met 


Asp 


cys 


Pro 




280 








285 


Val 


Arg 


Met Leu 


Ser 


Asp 


Leu 
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290 






Leu 


Val 


Glv Cvs 


Tyr Asn lieu 








305 




Aircr 


Ash 


Lvs 


Val Leu 


Leu Glu Ser 








320 




Met 


Ala 


Phe 


Phe Gly 


Leu Thr Glu 








335 




XfCu 


Phe 


Glu 


Lys Thr 


Phe Asn Met 








350 




Gin 




Asn 


Thr Thr 


fUb^ fx«LGL »JOJ. 








365 




lie 


Gin 


ys 


Ar^ lis 


RTii Rlv T.sni 








380 










Ala Lys 


AB»^ JjoUL JcJLlo 








395 








Lys 


Rill Hi Q 


RIti 








410 






Lys 






fSTv Zlvrff T>AH 






425 




Gin 


Gly 


Gin 


Gly Gin 


Ser Gin Asn 








440 




Asm 


Pro 


Asn 


Pro Aen 


Ala Asn Gin 








455 




Gin 


Asn 


Leu 


Hir Gin 


Ser Leu Ser 








470 




Pro 


Lys 


Gin 


Asn Ser 


Gly Lys Glu 








485 




Gly 


Thr 


Asn 


Asp Tyr 


He Gly Ser 








500 





<210> 12 

<211> 439 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc^foature 

<223> Incyte IB No: 3538709CD1 

<400> 12 



Met 


Leu 




Gly 


Val 


Thr 


Asp 


Gly 


1 








5 








Thr 


Pro 


Pro 


Asn 


Ala 
20 


Val 


Gly 


Pro 


Asp 


Gly 


Tyr 


Thr 


Phe 

35 


Val 


Glu 


Val 


Val 


Lys 


His 


Ala 


Gly 
50 


Pro 


Ala 


Pro 


Ser 


Ser 


Ala 


Ser 


Ser 
65 


Asp 


Ala 


Ala 


Val 


Arg 


Cys 


Gin 


Arg 
80 


Arg 


He 


Thr 


Leu 


Val 


Glu 


Asn 


Leu 
95 


Gly 


Arg 


Ala 


Gly 


Gin 


Asn 


Gly 


Ser 
110 


Gly 


Glu 


Pro 


Leu 


Ala 


Asp 


Pro 


Ala 
125 


Gly 


Ser 


Asp 


Ala 


Gly 


Ser 


Gly 


Ser 
140 


Gly 


Ser 


Gly 


Ala 


Arg 


Arg 


Pro 


Lya 
155 


Arg 


Thr 


He 



295 300 



Ser Val 


Met 


Pro 


Glu 


Lys 


Gin 


310 








315 


Ala Lys 


Ser 


Asn 


Leu 


Lys 


His 


325 










330 


Phe Gin 


Ara 


Lvs 


Thr 


Gin 


Tvr 


340 










345 


Asn Phe 


He 


Ser 


Pro 


Phe 


Thr- 


355 










360 


Ser Val 


Glu 


He 


Asn 


Glu 


Glu 


370 










375 


Asn Phe 


Leu 


Asp 


Met 


Glu 


Leu 


385 










390 


T.«ii Glin 


Arg 




Gin 


Ph.e 


Met 


400 










405 




Lys 


Arg 


Gin 


Glu 


Gin 


415 










420 


Leu Gin 


Thr 


His 


Pile 


Gin 


Ser 


430 










435 


Pro Asn 


Gin 


Asn 


Gin 


Ser 


Gin 


445 










450 


Asn Leu 


Thr 


Gin 


Asn 


Leu 


Met 


460 










465 


Gin Lys 


Glu 


Asn 


Arg 


Glu 


Ser 


475 










480 


Gin Aan 


Asp 


Asn 


Thr 


Ser 


Asn 


490 










495 


Val Glu 


Lys 


Trp 


Arg 






505 




< 








lie Phe 


Cvs 


Cvs 


Leu 


Leu 


Glv 


10 










15 


Leu Glu 


Ser 


Val 


Glu 


Ser 


Ser 


25 










30 


Lys Pro 


Gly 


Arg 


Val 


Leu 


Arcr 


40 










45 


Ala Ala 


Ala 


Pro 


Pro 


Pro 


Pro 


55 










60 


Gin Gly 


Asp 


Leu 


Ser 


Gly 


Leu 


70 










75 


Val Tyr 


Arg 


Asn 


Gly 


Arg 


Leu 


85 










90 


Pro Arg 


Ala 


Asp 


Leu 


Leu 


His 


100 










105 


Pro Ala 


Ala 


Leu 


Glu 


Val 


Glu 


115 










120 


Gly Arg 


Leu 


Ala 


Pro 


Gly 


Ser 


130 










135 


Ser Gly 


Qly 


Arg 


Arg 


Arg 


Arg 


145 










150 


His He 


Asp 


Cys 


Glu 


i*ys 


Arg 



160 165 
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lie 


Thr 


Ser 


Cys 


Lys 


Gly 


Ala 


Gin Ala 


Asp 


Val Val Leu Phe 


Phe 










170 








175 




180 


lie 


His 


Gly 


Val 


Gly 


Gly 


Ser 


Leu Ala 


He 


Trp Lys Glu Gin 


Leu 










185 








190 




195 


Asp 


Phe 


Phe 


Val 


Arg 


Leu 


Gly 


Tyr Qlu 


Val 


Val Ala Pro Asp 


Leu 










200 








205 




210 


Ala 


Gly 


His 


Gly 


Ala 


Ser 


Ser 


Ala Pro 


Gin 


Val Ala Ala Ala 


Tyr 










215 








220 




225 


Thr 


Phe 


Tyr 


Ala 


Leu 


Ala 


Glu 


Asp Met 


Arg 


Ala He Phe Lys 


Arg 










230 








235 




240 


T^rr 


Ala 


Lys 


Lys 


Arg 


Asn 


Val 


Leu He 


Gly 


His Ser Tyr Gly 


Val 










245 








250 




255 


Ser 


Phe 


Cys 


Thr 


Phe 


Leu 


Ala 


His Glu 


Tyr 


Pro Asp Leu Val 


His 










260 








265 




270 


Lys 


Val 


lie 


Met 


He 


Asn 


Gly 


Oly Qly 


Pro 


Thr Ala Leu Qlu 


Pro 










275 








280 




285 


Ser 


Phe 


Cys 


Ser 


He 


Phe 


Asn 


Met Pro 


Thr 


Cys Val Leu His 


Cys 










290 








295 




300 


Leu 


Ser 


Pro 


Cys 


Leu 


Ala 


Trp 


Ser Phe 


Leu 


Lys Ala Gly Phe 


Ala 










305 








310 




315 


Arg 


Gin 


Gly 


Ala 


Lys 


Glu 


Lys 


Oln Leu 


Leu 


Lys Glu Gly Asn 


Ala 










320 








325 




330 


Fhe 


Asn 


Val 


Ser 


Ser 


Phe 


Val 


Leu Arg 


Ala 


Met Met Ser Gly 


Gin 










335 








340 




345 


Tyr 


Trp 


Pro 


Glu 


Gly 


Asp 


Glu 


Val Tyr 


His 


Ala Glu Leu Thr 


Val 










350 








355 




360 


Pro 


Val 


Leu 


Leu 


Val 


His 


Qly 


Met His 


Asp 


Lys Phe Val Pro 


Val 










365 








370 




375 


GlU 


Glu 


Asp 


Gin 


Arg 


Met 


Ala 


Glu He 


Leu 


Leu Leu Ala Phe 


Leu 










380 








385 




390 


Lys 


Leu 


He 


Asp 


Glu 


Gly 


Ser 


His Met 


Val 


Met Leu Glu Cys 


Pro 










395 








400 




405 


Glu 


'Thr 


Val 


Asn 


Thr 


Leu 


Leu 


His Glu 


Phe 


Leu Leu Trp Glu 


Pro 










410 








415 




420 


Qlu 


Pro 


Ser 


Pro 


Lys 


Ala 


Leu 


Pro Glu 


Pro 


Leu Pro Ala Pro 


Pro 



425 430 435 

Glu Asp Lys Lys 



<210> 13 

<211> 514 

<212> PRT 

<213> HoiDO sapiens 

<220> 

<221> misc_£eature 

<223> Incyte ID No: 71563101CD1 

<400> 13 



Met 


Thr 


Leu 


He 


Trp 


Arg 


His 


Leu Leu 


Arg 


Pro Leu cys Leu 


Val 


1 








5 








10 




15 


Thr 


Ser 


Ala 


Pro 


Arg 


He 


Leu 


Glu Met 


His 


Pro Phe Leu Ser 


Leu 










20 








25 




30 


Gly 


Thr 


Ser 


Arg 


Thr 


Ser 


Val 


Thr Lys 


Leu 


Ser Leu His Thr 


Lys 










35 








40 




45 


Pro 


Arg 


Met 


Pro 


Pro 


Cys 


Asp 


Phe Met 


Pro 


Glu Arg Tyr Gin 


Ser 










50 








55 




60 


Leu 


Qly 


Tyr 


Asn 


Arg 


Val 


Leu 


Glu He 


His 


Lys Glu His Leu 


Ser 










65 








70 




75 


Pro 


Val 


Val 


Thr 


Ala 


Tyr 


Phe 


Gin Lys 


Pro 


Leu Leu Leu His 


Gin 










80 








85 




90 


Gly 


His 


Met 


Glu 


Trp 


Leu 


Phe 


Asp Ala 


Glu 


Gly Asn Arg Tyr 


Leu 
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95 100 105 

Asp Phe Phe Ser Qly lie Val Hir Val Ser Val Gly His Cys His 

110 115 120 

Pro Lys Val Asn Ala Val Ala Gin Lys Gin Leu Gly Arg Leu Trp 

125 130 135 

His Thr Ser Thr Val Phe Phe His Pro Pro Met His Glu Tyr Ala 

140 145 150 

Glu Lys Leu Ala Ala Leu Leu Pro Glu Pro Leu Lys Val lie Phe 

155 160 165 

Leu Val Asn Ser Gly Ser Glu Ala Asn Glu Leu Ala Met Leu Met 

170 175 180 

Ala Arg Ala His Ser Asn Asn lie Asp lie lie Ser Phe Arg Gly 

185 190 195 

Ala Tyr His Gly Cys Ser Pro Tyr Thr Leu Qly Leu Thr Asn Val 

200 205 210 

Gly II© Tyr Lys Met Qlu Leu Pro Gly Gly Thr Gly Cys Gin Pro 

215 220 225 

Thr Met Cys Pro Asp Val Phe Arg Gly Pro Trp Gly Gly Ser His 

230 235 240 

cys Arg Asp Ser Pro Val Gin Thr He Arg Lys Cys Ser Cys Ala 

245 250 255 

Pro Asp Cys Cys Gin Ala Lys Asp Gin Tyr He Glu Gin Phe Lys 

260 265 270 

Asp Thr Leu Ser Thr Ser Val Ala Lys Ser He Ala Gly Phe Phe 

275 280 2B5 

Ala Glu Pro He Gin Gly Val Asn Gly Val Val Gin Tyr Pro Lys 

290 295 300 

Gly Phe Leu Lys Glu Ala Phe Glu Leu Val Arg Ala Arg Gly Gly 

305 310 315 

Val cys He Ala Asp Glu Val Gin Thr Gly Phe Gly Arg Leu Gly 

320 325 330 

Ser His Phe Trp Gly Phe Gin Thr His Asp Val Leu Pro Asp He 

335 340 345 

Val Thr Met Ala Lys Gly He Gly Asn Gly Phe Pro Met Ala Ala 

350 355 360 

Val He Thr Thr Pro Glu He Ala Lys Ser Leu Ala Lys CyB Leu 

365 370 375 

Gin His Phe Asn Thr Phe Gly Gly Asn Pro Met Ala Cys Ala He 

380 385 390 

Qly Ser Ala Val Leu Glu Val He Lys Glu Glu Asn Leu Gin Qlu 

395 400 405 

Asn Ser Gin Glu Val Gly Thr Tyr Met Leu Leu Lys Phe Ala Lys 

410 415 420 

Leu Arg Asp Glu Phe Qlu He Val Gly Asp Val Arg Qly Lys Gly 

425 430 435 

Leu Met He Gly He Glu Met Val Gin Asp Lys He Ser Cys Arg 

440 445 450 

Pro Leu Pro Arg Glu Qlu Val Asn Qln He His Glu Asp Cys Lys 

455 460 465 

His Met Gly Leu Leu Val Gly Arg Gly Ser He Phe Ser Gin Thr 

470 475 480 

Phe Arg He Ala Pro Ser Met Cys He Thr Lys Pro Qlu Val Asp 

485 490 495 

Phe Ala Val Glu V&l Phe Arg Ser Ala Leu Thr Gin His Met Glu 

500 505 510 

Arg Arg Ala Lys 



<210> 14 

<211> 226 

<212> PRT 

<213> HooxtD sapiens 
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<220> 

<221> misc_f eature 

<223> Incyte ID No: 7472027CD1 

<400> 14 

Met Arg Leu Cyis Glu Lys Thr Glu Leu Gin Leu He Gly Val Pro 
15 10 15 

Glu Ser Asp Arg Glu Asn Gly 'Sbr Lys Leu Glu Asn Thr Phe Gin 
20 25 30 

Asp He He Gin Glu Asn Phe Pro Asn Leu Ala Arg Gin Ala Asn 
35 40 45 

He Gin He Gin Met Ala Qly Gly Ser He Trp.Ile Glu Gly He 
50 55 60 

Pro Plie Pro Ser Asn Asn Phe Tfbr Asp Leu Arg Arg Leu Gin Asp 
65 70 75 

Glu He Val Leu Arg Asp Glu Asp Val He Thr Leu Ser Tyr Pro 
80 85 90 

Lys Ser Gly Ser Phe Trp He Val Glu He He Ser Leu He His 
95 100 105 

Ser Lys Gly Asp Pro Ser Trp Val Gin Ser Val Val Pro Trp Asp 
110 115 120 

Arg Ser Pro Trp He Glu Val Lys Arg Lys Lys Ala Gly Leu Glu 
125 130 135 

Ser Gin Lys Gly Pro His Leu oyr Thr Ser His Leu Pro He Gin 
140 145 150 

Leu Phe Pro Lys Ser Phe Lou Asn Ser Lys Ala Lys Cys He Tyr 
155 160 165 

Pro His Val Leu Met Leu Val Val Leu He Leu Gly His Lys Ser 
170 175 180 

Gin Trp Ser He Ala He Lys He Ser Glu Asn Ala Glu Ala 'Sfhr 
185 190 195 

Ser Lys Leu Gly Asn Gly Gin Arg Leu Glu Glu Phe Gly Gly Leu 
200 205 210 

Arg Arg Arg Gin Glu Asp Glu Arg Ser Leu Glu Phe Leu Arg Asp 
215 220 225 

cys 



<210> 15 
<211> 121 
<212> PRT 

<213> Hdno sapiens 
<220> 

<221> misc^feature 

<223> Incyte ID No: 74B035BCD1 

<400> 15 

Met Met Lys Val Met Tyr Met Leu Lys Gly Gin Ser Pro Val Gin 
15 10 15 

Gly Thr He His Phe Glu Gin Lys Glu Asn Glu Pro Phe Met Val 

20 25 30 

Ser Glu Cys He Thr Gly Leu Thr Glu Arg Gin His Arg Phe His 

35 40 45 

Val His Gin Phe Gly Asp Asn Thr Pro Gly Cys Thr Arg Ala Val 

50 55 60 

Pro Tyr Phe Asn Pro Leu Thr Lys Asn His Ser Gly Pro Arg He 

65 70 75 

Lys Arg Gly Arg Leu Glu Thr Trp Val Met Trp Pro Leu Ala Lys 

80 85 90 

Met Cys Arg His Met Ser Val Glu Asp Ser Leu Val Ser Leu Ser 

95 100 105 
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Gly His Tyr Ser lie Thr Ala His Thr Met Val Ser Met Thr Thr 
110 115 120 

Arg 



<210> 16 

<211> 486 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_featvire 

<223> Izicyte ID Ko; 1618256CD1 

<400> 16 



Met 


Gly 


Pro 


Leu 


Ser 


Pro 


Ala 


Arg 


Thr Leu 


Arg Leu 


Tip 


Gly 


Pro 


1 








5 








10 




15 


Arg 


Ser 


Leu 


Gly 


Val 
20 


Ala 


Leu 


Gly 


Val Phe 
25 


Met Thr 


lie 


Gly 


Phe 
30 


Ala 


Leu 


Gin 


Leu 


Leu 
35 


Oly 


Gly 


Pro 


Phe Gin 
40 


Arg Arg 


Leu 


Pro 


Gly 
45 


Leu 


Gin 


Leu 


Arg 


Gin 
50 


Pro 


Ser 


Ala 


Pro Ser 
55 


Leu Arg 


Pro 


Ala 


Leu 
60 


Pro 


Ser 


Cys 


Pro 


Pro 
65 


Arg 


Qln 


Arg 


Leu Val 
70 


Phe Leu 


Lys 


Thr 


His 
75 


Lys 


Ser 


Gly 


Ser 


Ser 
80 


Ser 


Val 


Leu 


Ser Leu 
85 


Leu His 


Arg 


Tyr 


Gly 
90 


Asp 


Qln 


His 


Gly 


Leu 
95 


Arg 


Phe 


Ala 


Leu Pro 
100 


Ala Arg 


Tyr 


Gin 


Phe 

105 


Gly 


Tyr 


Pro 


Lys 


Leu 
110 


Phe 


Gin 


Ala 


Ser Arg 
115 


Val Lys 


Gly 


Tyr 


Arg 
120 


Pro 


Gin 


Gly 


Gly 


Gly 
125 




Gin 


Leu 


Pro Phe 

130 


His lie 


Leu 


Cys 


His 
135 


His 


Met 


Arg 


Phe 


Asn 
140 


Leu 


Lya 


Glu 


Val Leu 
145 


Gin Val 


Met 


Pro 


Ser 
150 


Asp 


Ser 


Phe 


Phe 


Phe 
155 


Ser 


He 


Val 


Arg Asp 
160 


Pro Ala 


Ala 


Leu 


Ala 
165 


Arg 


Ser 


Ala 


Phe 


Ser 
170 


Tyr 


Tyr 


Lys 


Ser Thr 
175 


Ser Ser 


Ala 


Phe 


Arg 
180 


Lys 


Ser 


Pro 


Ser 


Leu 
185 


Ala 


Ala 


Phe 


Leu Ala 
190 


Asn Pro 


Arg 


Gly 


Phe 
195 


Tyr 


Arg 


Pro 


Gly 


Ala 
200 


Arg 


Gly 


Asp 


His Tyr 
205 


Ala Arg 


Asn 


Leu 


Leu 
210 


Trp 


Phe 


Asp 


Phe 


Gly 
215 


Leu 


Pro 


Phe 


Pro Pro 
220 


Glu Lys 


Arg 


Ala 


Lys 
225 


Arg 


Gly 


Asn 


lie 


His 
230 


Pro 


Pro 


Arg 


Asp Pro 
235 


Asn Pro 


Pro 


Gin 


Leu 
240 


Gin 


Val 


Leu 


Pro 


Ser 
245 


Gly 


Ala 


Gly 


Pro Arg 
250 


Ala Gin 


Thr 


Leu 


Asn 
255 


Pro 


Asn 


Ala 


Leu 


Tie 
260 


His 


Pro 


Val 


Ser Thr 
265 


Val Thr 


Asp 


His 


Arg 
270 


Ser 


Gin 


lie 


Ser 


Ser 
275 


Pro 


Ala 


Ser 


Phe Asp 

280 


Leu Gly 


Ser 


Ser 


Ser 
285 


Phe 


lie 


Gin 


Tzp 


Gly 
290 


Leu 


Ala 


Trp 


Leu Asp 
295 


Ser Val 


Phe 


Asp 


Leu 
300 


Val 


Met 


Val 


Ala 


Glu 
305 


Tyr 


Phe 


Asp 


Glu Ser 
310 


Leu Val 


Leu 


Leu 


Ala 
315 


Asp 


Ala 


Leu 


cys 


Trp 
320 


Gly 


Leu 


Asp 


Asp Val 
325 


Val Gly 


Phe 


Met 


His 
330 


Asn 


Ala 


Gin 


Ala 


Gly 

335 


His 


Lys 


Gin 


Gly Leu 
340 


Ser Thr 


Val 


Ser 


Asn 
345 


Ser 


Gly 


Leu 


Thr 


Ala 


Glu 


Asp 


Arg 


Gin Leu 


Thr Ala 


Arg 


Ala 


Arg 
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350 


355 




360 


Ala 


Ttp 


Asn 


Asn Leu 


Asp Trp Ala Leu Tyr 


Val His Phe Asn 


Arg 








365 


370 




375 


Ser 


Leu 


Trp 


Ala Arg 


He Qlu Lys Tyr Gly 


Gin Gly Arg Leu 


Gin 








380 


385 




390 


Thr 


Ala 


val 


Ala Glu 


Leu Arg Ala Arg Arg 


Glu Ala Leu Ala 










395 


400 




405 


His 


Cys 


Leu 


Val Gly 


Gly Glu Ala Ser Asp 


Pro Lys Tyr He 


Thr 








410 


415 




420 


Asp 


Arg 


Arg 


Phe Arg 


Pro Phe Gin Phe Gly 


Ser Ala Lys Val 


Leu 








425 


430 




435 


Gly 


Tyr 


He 


Leu Arg 


Ser Gly Leu Ser Pro 


Gin Asp Gin Glu 


Glu 








440 


445 




450 


Cys 


Glu 


Arg 


Leu Ala 


Thr Pro Glu Leu Gin 


Tyr Lys Asp Lys 


Leu 








455 


460 




465 


Asp 


VeQ 


Lys 


Gin Phe 


Pro Pro Thr Val Ser 


Leu Pro Leu Lya 


Thr 








470 


475 




480 


Ser 


Arg 


Pro 


Leu Ser 


Pro 












485 









<210> 17 

<211> 649 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> zaisc_f eature 

<223> Incyte ID No: 3387823aE>l 

<400> 17 



Met 


Tyr 


He 


Ser 


Cys 


Leu 


Ser 


Leu Ser Leu 


Phe Phe Leu Ser 


Gly 


i 








5 






10 




15 


Pro 


Leu 


Gin 


Arg 


Val 


Leu 


Glu 


Val Ser Asn 


His Txp Trp Tyr 


Ser 










20 






25 




30 


Met 


Leu 


He 


Leu 


Pro 


Pro 


Leu 


Leu Lys Asp 


Ser Val Ala Ala 


Pro 










35 






40 




45 


Leu 


Leu 


Ser 


Ala 


Ty^ 


Tyr 


Pro 


Asp cys Val 


Gly Met Ser Pro 


Ser 










50 






55 




60 


Cys 


Thr 


Ser 


Thr 


Asn 


Arg 


Ala 


Ala Ala Thr 


Gly Asn Ala Ser 


Pro 










65 






70 




75 


Gly 


hys 


Leu 


Glu 


His 


Ser 


Lys 


Ala Ala Leu 


Ser Val His Val 


Pro 










80 






85 




90 


Gly 


Met 


Asn 


Arg 


Tyr 


Phe 


Gin 


Pro Phe Tyr 


Gin Pro Asn Glu 


Cys 










95 






100 




105 


Gly 


Lys 


Ala 


Leu 


cys 


Val 


Arg 


Pro Asp Val 


Met Glu Leu Asp 


Glu 










110 






115 




120 


Leu 


Tyr 


Glu 


Phe 


Pro 


Glu 


Tyr 


Ser Arg Asp 


Pro Hhr Met Tyr 


Leu 










125 






130 




135 


Ala 


Leu 


Arg 


Asn 


Leu 


He 


Leu 


Ala Leu Trp 


Tyr Thr Asn Cys 


Lys 










140 






145 




150 


Glu 


Ala 


Leu 


Thr 


Pro 


Gin 


Lys 


Cys He Pro 


His He He Val 


Arg 










155 






160 




165 


Gly 


I*eu 


Val 


Arg 


IlG 


Arg 


Cys 


Val Gin Glu 


Val Glu Arg He 


Leu 










170 






175 




180 


Tyr 


Phe 


Met 


Thr 


Arg 


Lys 


Gly 


Leu He Asn 


Thr Gly Val Leu 


Ser 










185 






190 




195 


Val 


Gly 


Ala 


Asp 


Gin 


Tyr 


Leu 


Leu Pro Lys 


Asp Tyr His Asn 


Lys 










200 






205 




210 


Ser 


Val 


He 


He 


He 


Gly 


Ala 


Gly Pro Ala 


Gly Leu Ala Ala 


Ala 










215 






220 




225 


Arg 


Gin 


Leu 


His 


Aen 


Phe 


Gly 


He Lys Val 


Thr Val Leu Glu 


Ala 










230 






235 




240 
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Lys Asp Arg lie Gly Gly Arg Val Trp Asp Asp Lys Ser Phe Lys 

245 250 255 

Gly Val Thr Val Gly Arg Gly Ala Gin lie Val Asn Gly Cys lie 

260 265 270 

Asn Asn Pro Val Ala Leu Met Cys Glu Gin Leu Gly lie Ser Met 

275 280 285 

His Lys Phe Gly Glu Arg Cys Asp Leu lie Gin Glu Gly Gly Arg 

290 295 300 

lie Thr Asp Pro Ihr lie Asp Lys Arg Met Asp Phe His Phs Asn 

305 310 315 

Ala Leu Leu Asp Val Val Ser Glu Trp Arg Lys Asp Lys Thr Gin 

320 325 330 

Leu Gin Asp Val Pro Leu Gly Glu Lys lie Glu Glu He Tyr Lys 

335 340 345 

Ala Phe He Lys Glu Ser Gly He Gin Phe Ser Qlu Leu Glu Gly 

350 355 360 

Gin Val Leu Gin Phe His Leu Ser Asn Leu Glu Tyr Ala Cys Gly 

365 370 375 

Ser Asn Leu His Gin Val Ser Ala Arg Ser Trp Asp His Asn Glu 

380 385 390 

Phe Phe Ala Gin Phe Ala Gly Asp His Thr Leu Leu Thr Pro Gly 

395 400 405 

Tyr Ser Val He He Qlu Lys Leu Ala Glu Oly Leu Asp He Gin 

410 415 420 

Leu Lys Ser Pro Val Gin Cys He Asp Tyr Ser Gly Asp Glu Val 

425 430 435 

Gin Val Thr Thr Thr Asp Gly Thr Gly CPyr Ser Ala Gin Lys Val 

440 445 450 

Lsu Val Thr Val Pro Lea Ala Lew Leu Gin Lys Gly Ala He Gin 

455 460 465 

Phe Asn Pro Pro Leu Ser Glu Lys Lys Met Lys Ala He Asn Ser 

470 475 480 

Leu Gly Ala Gly He He Glu Lys He Ala Leu Gin Phe Pro Tyr 

485 490 495 

Arg Phe Trp Asp Ser Lys Val Gin Gly Ala Asp Phe Phe Gly His 

500 505 510 

Val Pro Pro Ser Ala Ser Lys Arg Oly Leu Phe Ala Val Phe Tyr 

515 520 525 

Asp Met Asp Pro Gin Lys Lys His Ser Val Leu Met Ser VclL He 

530 535 540 

Ala Gly Glu Ala Val Ala Ser Val Arg Thr Leu Asp Asp Lys Gin 

545 550 555 

Val Leu Gin Gin Cys Met Ala Thr Leu Arg Glu Leu Phe Lys Glu 

560 565 570 

Gin Glu Val Pro Asp Pro Thr Lys Tyr Phe Val Thr Arg Trp Ser 

575 , 580 585 

Thr Asp Pro Trp He Gin Met Ala Tyr Ser Phe Val Lys Thr Gly 

590 595 600 

Gly Ser Gly Glu Ala Tyr Asp He He Ala Glu Asp He Gin Gly 

605 610 615 

Thr Val Phe Phe Ala Gly Glu Ala Thr Asn Arg His Phe Pro Gin 

620 625 630 

Thr Val T3ar Gly Ala Tyr Leu Ser Gly Val Arg Glu Ala Ser Lys 

635 640 645 

He Ala Ala Phe 



<210> 18 

<211> 258 

<212> PRT 

<213> Homo sc^iens 
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<220> 

<221> misc^feature 

<223> Incyte ID No: 55142051C©! 

<400> 18 

Met Ser Pro Ala He Ala Lea Ala Phe Leu Pro Leu Val Val ^hr 
15 10 15 

I»eu Leu Val Arg Tyr Arg His Tyr Phe Arg Leu Leu Val Arg Thr 
20 25 30 

Val Leu Leu Arg Ser Leu Arg Asp Cys Leu Ser Gly Leu Arg He 
35 40 45 

Glu Glu Arg Ala Phe Ser Tyr Val Leu 'ihr Hi's Ala Leu Pro Gly 
50 55 60 

Asp Pro Gly His He Leu Thr Thr Leu Asp His Trp Ser Ser Arg 
65 70 75 

cys Glu Tyr Leu Ser His Met Gly Pro Val Lys Gly Gin He Leu 
80 85 90 

Met Arg Leu Val Glu Glu Lys Ala Pro Ala Cys Val Leu Glu Leu 
95 100 105 

Gly Thr Tyr Cys Gly Tyr Ser Thr Leu Leu He Ala Arg Ala Leu 
110 115 120 

Pro Pro Gly Gly Arg Leu Leu Thr Val Glu Arg Asp Pro Arg Thr 
.125 . 130 135 

Ala Ala Val Ala Glu Lys Leu He Arg Leu Ala Gly Phe Asp Glu 
140 145 150 

His Met Val Glu Leu He Val Gly Ser Ser Glu Asp Val He Pro 
155 160 165 

cys Leu Arg Thr Gin Tyr Gin Leu Ser Arg Ala Asp Leu Val Lou 
170 175 180 

Leu Ala His Arg Pro Arg Cys Tyr Leu Arg Asp Leu Gin Leu Leu 
185 190 195 

Glu Ala His Ala Leu Leu Pro Ala Gly Ala Thr Val Leu Ala Asp 
200 205 210 

His Val Leu Phe Pro Gly Ala Pro Arg Phe Leu Gin Tyr Ala Lys 
215 220 225 

Ser Cys Gly Arg Tyr Arg Cys Arg Leu His His Thr Gly Leu Pro 
230 235 240 

Asp Phe Pro Ala He Lys Asp Gly He Ala Gin Leu Thr Tyr Ala 
245 250 255 

Gly Pro Gly 



<210> 19 

<211> 544 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Ijcicyte ID No: 7395274CD1 

<400> 19 

Met Ser Ser Pro Gly Pro Ser Gin Pro Pro Ala Glu Asp Pro Pro 
15 10 15 

Trp Pro Ala Arg Leii Leu Arg Ala Pro Leu Gly Leu Leu Arg Leu 

20 25 30 

Asp Pro Ser Gly Gly Ala Leu Leu Leu cys Gly Leu Val Ala Leu 

35 40 45 

Leu Gly Trp Ser Trp Leu Arg Arg Arg Arg Ala Arg Gly He Pro 

50 55 60 

Pro Gly Pro Thr Pro Trp Pro Leu Val Gly Asn Phe Gly His Val 

65 70 75 
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Leu 


Leu 


Pro 


Pro 


Phe 
80 


Leu 


Arg 


Arg 


Arg 


Ser 
85 


Trp 


Leu 


Ser 


Ser 


Arg 
90 


Thr 


Arg 


Ala 


Ala 


Gly 
95 


He 


Asp 


Pro 


Ser 


Val 
100 


He 


Gly 


Pro 


Gin 


Val 
105 


Leu 


Leu 


Ala 


His 


Leu 
110 


Ala 


Arg 


Val 


Tyr 


Gly 
115 


Ser 


He 


Phe 


Ser 


Phe 
120 


Phe 


He 


Gly 


His 


Tyr 
125 


Leu 


Val 


Val 


Val 


Leu 
130 


Ser 


Asp 


Phe 


His 


Ser 
135 


Val 


Arg 


Glu 


Ala 


Leu 
140 


Val 


Gin 


Gin 


Ala 


Glu 
145 


Val 


Phe 


Ser 


Asp 


Arg 
150 


Pro 


Arg 


Val 


Pro 


Leu 
155 


He 


Ser 


He 


Val 


Thr 
160 


Lys 


Glu 


Lys 


Gly 


Val 
165 


Val 


Phe 


Ala 


His 


Tyr 
170 


Gly 


Pro 


Val 


Trp 


Arg 
175 


Gin 


Gin 


Arg 


Lys 


Phe 
180 


Ser 


His 


Ser 


Thr 


Leu 
185 


Arg 


His 


Phe 


Gly 


Leu 
190 


Gly 


Lys 


Leu 


Ser 


Leu 
195 


Glu 


Pro 


Lys 


He 


He 
200 


Glu 


Glu 


Phe 


Lys 


Tyr 
205 


Val 


Lys 


Ala 


Glu 


Met 
210 


Gin 


Lys 


His 


Gly 


Glu 
215 


Asp 


Pro 


Phe 


Cys 


Pro 
220 


Phe 


Ser 


He 


He 


Ser 
225 


Asn 


Ala 


Val 


Ser 


Asn 
230 


He 


He 


Cys 


Ser 


Leu 

235 


Cys 


Phe 


Gly 


Gin 


Arg 
240 


Phe 


Asp 


Tyr 


Tflir 


Asn 
245 


Ser 


Glu 


Phe 


Lys 


Lys 
250 


Met 


Leu 


Gly 


Phe 


Met 
255 


Ser 


Arg 


Gly 


Leu 


Glu 
260 


He 


Cys 


Leu 


Asn 


Ser 
265 


Gin 


Val 


Leu 


Leu 


Val 
270 


Asn 


He 


Cys 


Pro 


Trp 
275 


Leu 


Tyr 


Tyr 


Leu 


Pro 
280 


Phe 


Gly 


Pro 


Phe 


Lys 
285 


Glu 


Leu 


Arg 


Gin 


He 
290 


Glu 


Lys 


Asp 


He 


Thr 
295 


Ser 


Phe 


Leu 


Lys 


Lys 
300 


He 


He 


Lys 


Asp 


His 
305 


Gin 


Glu 


Ser 


Leu 


Asp 
310 


Arg 


Glu 


Asn 


Pro 


Gin 
315 


Asp 


Phe 


He 


Asp 


Met 
320 


Tyr 


Leu 


Leu 


His 


Met 
325 


Glu 


Glu 


Glu 


Arg 


Lys 
330 


Asn 


Asn 


Ser 


Asn 


Ser 
335 


Ser 


Phe 


Asp 


Glu 


Glu 
340 


Tyr 


Leu 


Phe 


Tyr 


He 
345 


He 


Gly 


Asp 


Leu 


Phe 
350 


He 


Ala 


Gly 


Thr 


Asp 
355 


Thr 


Thr 


Thr 


Asn 


Ser 
360 


Leu 


Leu 


Trp 


cys 


Leu 
365 


Leu 


Tyr 


Met 


Ser 


Leu 
370 


Asn 


Pro 


A^ 


Val 


Gin 
375 


Glu 


Lys 


Val 


His 


Glu 
380 


Glu 


He 


Glu 


Arg 


Val 
385 


He 


Gly 


Ala 


Asn 


Arg 
390 


Ala 


Pro 


Ser 


Leu 


Thr 
395 


Asp 


Lys 


Ala 


Gin 


Met 
40D 


Pro 


l^yr 


Thr 


Glu 


Ala 
405 


Thr 


He 


Net 


Glu 


Val 
410 


Gin 


Arg 


Leu 


Thr 


Val 
415 


Val 


Val 


Pro 


Leu 


Ala 
420 


He 


Pro 


His 


Met 


Thr 
425 


Ser 


Glu 


Asn 


Thr 


Val 
430 


Leu 


Gin 


Gly 


Tyr 


Thr 
435 


He 


Pro 


Lys 


Gly 


Thr 
440 


Leu 


He 


Leu 


Pro 


Asn 
445 


Leu 


Trp 


Ser 


Val 


His 
450 


Arg 


Asp 


Pro 


Ala 


He 
455 


Trp 


Glu 


Lys 


Pro 


Glu 
460 


Asp 


Phe 


Tyr 


Pro 


Asn 
465 


Arg 


Phe 


Leu 


Asp 


Asp 
470 


Gin 


Gly 


Gin 


Leu 


He 
475 


Lys 


Lys 


Glu 


Thr 


Phe 
480 


He 


Pro 


Phe 


Gly 


He 
485 


Gly 


Lys 


Arg 


Val 


Cys 
490 


Met 


Gly 


Glu 


Gin 


Leu 
495 


Ala 


Lys 


Met 


Glu 


Leu 

500 


Phe 


Leu 


Met 


Phe 


Val 
505 


Ser 


Leu 


Met 


Gin 


Ser 
510 


Phe 


Ala 


Phe 


Ala 


Leu 
515 


Pro 


Glu 


Asp 


Ser 


Lys 
520 


Lys 


Pro 


Leu 


Leu 


Thr 
525 


Gly 


Arg 


Phe 


Gly 


Leu 
530 


Thr 


Leu 


Ala 


Pro 


His 
535 


Pro 


Phe 


Asn 


He 


Thr 
540 



22/36 



wo 02/12467 



PCT/USOl/24382 



lie Ser Arg Arg 



<210> 20 

<211> 2603 

<212> nNA 

<213> Hcmo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 72482 85CB1 

<400> 20 

atggcgtggt ccccaccagc caccctctht 
cccagcaggc cacagbcact gggcaccact 
ccagaggagg gccgcctgga ggtgctgcac 
aactttgcta tccaggaggc cacagtggct 
acctgggccc acagtgcceia gtiacggccaa 
cgctgtgtgg gcacagagag ctccttggac 
gactgcagtc actcagaaga cgtaggggtg 
ctttctgaaa ctgtctcceia tgcccttggg 
ctcaagccca tccttgccag tgccaagcag 
gtgaagtatg agggccactg gcggcaggtg 
agggtggtgt gcgggatgct gggcttcccc 
aggaaagtct gggatctgaa gatgagggac 
aagaactcct tctggatcca ccaggtcacc 
tgccaggtgc aggtggctcc agcccggggc 
catgctgtgg tcagctgtgt ggcagggcct 
cgcaaagggt cctgggcaga ggagccgagg 
gagggccggg fcggaagtgct catgaaccgc 
aacctcatct ctgccagtgt cgtgtgtcgt 
ctctttgggg cccggctggg ccaagggcta 
aggggatatg agcggaccct cagcgactgc 
caacatgaga atgatgctgc tgtcaggtgc 
gtgcgcttgg ctggtgggcg tetccctgag 
aacggggtcc cacgctgggg gagcgtgtgc 
gtggcctgcc gacagctcgg cctgggtttt 
tggtcgggga cgccaagggc ccaggaggtg 
gagctggccc tgcagcagtg ccagaggcac 
cgcttcctgg ctggagtctc ctgcatggac 
ctagtgcagg agacggccta cttggaggac 
gaggagaact gcctctccaa gtctgcggat 
ctattgcgct tctccacaca gatctacaat 
ggacgcgata gctgggtttg gcaccagtgc 
acccactacg acctcctcac tctcaatggc 
ttctgtctgg aggacacaaa ctgccccaca 
tttggagaac agggagtgac tgtaggctgc 
cagtgggtgg atatcacaga tgtgggcccc 
ccccactatg aagtggcaga gtcagatttc 
tatgatgggc accgggtctg gctgcacaac 
gcagaactct ccctggagca ggaacagcgt 
tgcacactcc tagctgctgc cgatacacca 
cacagagkcc Cciactcagag gaaaagggcc 
gaagcctttt gatggcaaga tcaccaatcc 
ggcctgcccc taagggcctg tggcctatgg 
ctcctcttct gtaaggaaac ccagtcatcc 
ggagctctca gtttcttaag gag 

<210> 21 

<211> 1745 

<212> UNA 

<213> Homo sapiens 



ctgttcctgc tgctgctagg ecagccccct 60 
aagctccggc tggtgggccc agagagceiag 120 
cagggccagt ggggcaccgt gtgtgatgac 180 
tgccgccagc tgggcttcga agctgccttg 240 
ggggagggac ccatctggct ggacaatgtg 300 
cagtgcgggt ctaatggctg gggagtcagt 360 
atatgccacc cccggcgcca tcgtggctac 420 
ccccagggcc agcggctgga ggaggtgcgg 480 
catagcccag tgaccgaggg agccgtggag 540 
tgtgaccagg gctggaccat gaacaacagc 600 
agcgaggtgc ctgtcgacsig ccactactac 660 
cctaagtcta ggctgaagag cctgacgaab 720 
tgcctgggga cagagcccca catggccaac 780 
aagctgcggc cagcctgccc aggtggcatg 840 
cacttccgcc caccgaagac aaagccacaa 900 
gtgcgcctgc gctccggggc ccaggtgggc 960 
cagtggggca cggtctgtga ccacaggtgg 1020 
cagctgggct ttggctctgc tcgggaggcc IQBO 
gggcccatcc^acctgagtga ggtgcgctgc 1140 
cctgccchgg aagggtccca gaatggttgc 1200 
aatgtcccta acatgggctt tcagaatcag 1260 
gaggggctat tggaggtgca ggtggaggtg 1320 
agtgaaaact gggggctcac cgaagccatg 1380 
gccatccatg cctacaagga aacctggttc 1440 
gtgatgagtg gggtgcgctg ctcaggcaca 1500 
gggccggtgc actgctccca cggtggcggg 1560 
agtgcaccag acctggtgat gaacgcccag 1620 
cgcccgctca gccagctgta ttgtgcccac 1680 
cacatggact ggccctacgg ataccgccgc 1740 
ctgggccgga ctgactttcg tccaaagact 1800 
cacaggcatt accacagcat tgaggtcttc 1860 
tccaaggtgg ctgaggggca caaggccagc 1920 
ggactgcagc ggcgctacgc atgtgccaac 1980 
tgggacacct accggcatga cattgattgc 2040 
gggaabtata tcttccaggt gattgbgaac 2100 
tccaacaata tgctgcagbg ccgcbgcaag 2160 
bgccacacag ggaattcaba. cccagccaat 2220 
cbcaggaaca acctcabcbg aagcbgtcac 2280 
gabaccbcag cbbatbggag ccabgccctb 2340 
agbgccaagg ggcaccaaga acctgctcag 2400 
agatggtabt gcbcccbcag gatggcbcbg 2460 
aatatgtcct ccaggctttg ctcagctgag 2520 
cbgaatcbbg ccacagagab ccgggabbca 2580 

2603 
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<220> 

<221> misc_feature 

<223> Incyte ID No: 7472835CB1 



<400> 21 

gtaatttttt ttaaatggga tggatgacag 
tgagacacaa tggccaaaaa agcgattgct 
gccatcaagt gctgcctgga tgaagatctg 
attggacatc tctggaaatt tcaaaaaaat 
tctgtgacca tcaatacttc caaggagatg 
cattttccca actacatgca caactccaaa 
cactttggcc tcctgaatta cattcgtttt 
ccagattttt ctatcaatgg acaatgggat 
actttggtct ttgatggggt cttagtttgc 
cttcagtcct tcccaggtat ggagaaattt 
aaaagtcccg aggacttttc cigggaaaaga 
gtggatattg cggtggagct cagtcgtgta 
cgtggatcat ggattttaca ccgtgtttgg 
ttcactcggt tcaatagttt tctccagaaa 
ctagagaaaa taatgaactc aagatttaat 
agagctttaa gtcagcatcc aactgtcagt 
aaagtccaag taaagcccag cgtgaaggag 
agcactgtag aggagaatat tgatgttgtc 
tictttccttg atggtctgat caaggttact 
ttccctcctg acctggagaa gccaaccttg 
atcatcttac ctattgcaga gctccaatct 
atcaaattac cctcagcgga gaacatgatg 
gaaaaacggt atgtaaagac accccgccac 
gatgagattg ccatgccagc aggggtgaaa 
ccaaagctgg ccatggaggt tttctttggc 
gggcccgaga aatgggatgg ggcccggaga 
aagcccctga ggactcgcat tactagtgag 
ataa£Lgatgg ccccagtgag cctggcattt 
actccttacg gtaaatggaa ataaatgaaa 
ttcta 

<210> 22 

<211> 1587 

<212> DMA 

<213> Hotmo sapiens 



tgacagagct ctaaatttct actaaccagc 60 
gtgattggag ctggaattag cggactgggg 120 
gagcccacct gctttgaiaag aaatgatgat 180 
acttcagaga aaatgcctag tatctacaaa 240 
atgtgcttca gtgacttccc tgtccctgat 300 
ctcatggact acttcgggat gtatgccaca 360 
aagactgaag tgcaaagtgt gaggaagcac 420 
gttgttgtgg agactgaaga. gaaacaagag 480 
agtggacacc acacagatcc ctacttacca 540 
gaaggctgtt atttccatag tcgggaatac 600 
atcatagtga tcggcattgg aaattctgga 660 
gcaaaacagg tbatattcct tagtactaga 720 
gataatgggt abcccatgga tagttcattt 780 
atactaacta caccacaaat aaataaccag 840 
catgcgcact gtggcctgca gcctcagcac 900 
gatgacctgc caaatcacat aatttctgga 960 
ttcacagaaa cagatgccat ttttgaagac 1020 
atctttgcta caggatacag tttttctttt 1080 
aacaatgaag tatctctgta taagcttatg 1140 
gctgtcatcg gtcttatcca accactgggc 1200 
cgttgggcta cacgagtgtt caaaggtctg 1260 
gcagatattg cccagaggaa aagggctatg 1320 
acaatccaag tggatcacat tgagtacatg 1380 
cccaacctgc tcttcctctt tctctcagat 1440 
ccctgcaccc cataccagta ccaccbccat 1500 
gctaacctga cccagagaga gaggatcatc 1560 
gacagccacc catcctcaca gctctcttgg. 1620 
ctggctgctg gctfcggcata ctttcgatat 1680 
gaacactgag ggggaaaagc atggaatagt 1740 

1745 



<220> 

<221> misc^featvire 

<223> IncytQ ID No: 7476203CB1 

<400> 22 

atgctctccc tgctcagtgg gctggcttta 
ggcaccttct gctgggacag gagctgtctt 
ggaaacctgt ggcagetatg ctttcagcag 
aagataggaa atatctttat gaacttggga 
ttaccggtga ctttccatcc cttgaaccaa 
cctaeiaccct tccctaccag aaatccaggc 
caaaagagac gcttctgcct ggtgatgatt 
gaggtgcagc tgcagaaaga ggcagcagag 
aagagaccct tcgaccctca ggtatccatt 
cttgtgtttg gccaccactt cctcttagag 
atcgactttg gcctggcatt tgtcagcact 
tgggccctct gccacct ccc aggaccccac 
ctgagcttaa tccaccagga gatcaccagg 
gacttcatca gctgctacct ggcccagatc 
ttcaaccagg agaacctggt ccaggtggtg 
acagccacca ccctgtgctg ggcactcatc 



ctggccatct ccttcctgct cctgaaactt 60 
cctcctggcc cacteccctt ccccatcctt 120 
cctcaccttt cacttaaaaa ctttcagaag 180 
agcagtgtgg ttccactggc attgccattg 240 
ggcgtgttat gtaagccact tattactttc 300 
atcatctgca gcagcgggca cacgtggcgg 360 
cgagggctgg gcctaggcaa gctggc3gctg 420 
ctggcagaag ccttccgcca ggagcagggt 480 
gtcaggtcca cagtcagagt catcggggcc 540 
gatcccatct tccaggaact gactcaagcc 600 
gtgtggcgcc agctgtatga cgtgtttccc 660 
caggagatat ttaggtacca agaggtcgtg 720 
cacaaactca gggcaccgga ggcccccagg 780 
tccaaggcca tggatgaccc tgtctccaca 840 
atcgacctgt ttctgggagg caccgacacc 900 
cacatgatcc agcacggagc tgtccaggag 960 
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acggtgcagc tggagctgga 
cgcaagcgac tgccttacac 
tggccatggg tgccgtgcgc 
gcaagggcac catcatctta. 
agacccctcg acagttcaac 
atgaggcctt cctgccattc 
aaatggagct cttcctgatg 
aagggagccc ggggctcaag 
aggagatctg cgcagtgccc 
tgtagccact gggggtctgg 
ttcctgcaaa gttagaeiaag 

<210> 23 
<211> 1038 
<212> nNA 
<213> Homo sapiens 

<220> 

<221> misc^feature 
<223> mcyte ID No: 7478583CB1 

<400> 23 

atgaaagcag cagtttggta tggtcaaaaa gatgtaagag ttgaagaacg tgaacctaaa 60 
gaattacagg acaatgaagt taaggttaaa gtatcttggg ctggcatttg tggtacagat 120 
ttacatgaat acttagaagg ccctatattt atttcaacag aaaagccaga tccattctta 180 
ggtcaaaaag cgccagttac attgggtcat gaatttgcag gtgtagtaga agaaactggt 240 
tcccaagtta caaaatttaa taaaggcgat cgagttgtag ttaaccctac agtttcaaaa 300 
cgtgaaaaag aagaaaatat tgacctttat gatggttatt catttatagg cttaggttct 360 
gatggtggat ttgcagagtt tacaaatgcg ccggaagaaa atgtttataa actaccagat 420 
aatgtttctg ataaagaagg tgcgcttgtc gaaccaacag ccgttgcagt tcaagcaatt 480 
aaagaaggtg aagttctatt tggtgatact gtagctattt ttggtgcagg accaattgga 540 
ttattaacag tcgtagcagc caaagcagct ggtgcaagta aaatatttgt tttcgattta 600 
tcagaagaaa gactaagtaa agctaaagca ctaggcgcaa ctcatgctat aaactctggt 660 
aagacagatc cagttgatgt tattaatgag tatacagaaa atggtgtaga tgtatctttt 720 
gaagtggctg gtgtagcacc aacacttaaa tcttctatag atgttacaaa agcaagaggt 780 
acagttgtta tcgtttctat ttttggtcat cctatcgagt g^aatccaat gcaattaact 840 
aatacaggag taaaacttac ttctacaatt gcatacacac ctactacatt ccaacaaaca 900 
attgacttaa tcaacgaagg taatttaaac gttaaagatg tagttactga tgaaattgag 960 
ttagaaaata tcgtagaatc aggatttgaa caacttgtaa atgataaatc tcaagcaaaa 1020 
atattaatta aattataa 1^^® 

<210> 24 
<211> 1584 
<212> DNA 

<213> Hamo sapiens 
<220> 

<221> inisc_f eatiire 

<223> locyte ID No: 7478585CB1 

<400> 24 

atgaaatcat taatgattgg cgctaatggt ggcgtcggtc aacatctcgt acgtaaacta 60 
aatgcacgag atgttgattt tacggccggt gttagaaaag aagaacaagt tgaagcttta 120 
aaagcagatg gtatcgatgc ciacttacatt gatgttgcaa aacaatctat tgatgaatta 180 
atagaattat ttaaaccgta tgaccaaatc cttttttctg tcggttctgg tggaagcaca 240 
ggtgacgatc aaacaatcat agtagattta gacggttcag tgaaagcaat taaagcaagt 300 
gaacacgtcg gtcgtcaaca ctttgttatg gtatcaacgt acgattcacg tcgagaagct 360 
tttgatgcgt caggcgactt gaaaccatac accattgcta aacattatgc ggatgactac 420 
ttaagacatg caaatttaaa atataccatc gtacatccag gcgcattaac taacaatcat 480 
gaaacgcaac aattcaatat gagtgctcaa tttgaaaatg tacaaaatcc gtctatcacg 540 
agagaagatg tagcagaagt gcttgtttct gtgttaactg atgaaacatt acaagttgta 600 
cttgcaaaac gaccacaaag tatccctceia gacgatgtat ttagatttga aacaatagaa 660 



cgaggtgctg ggtgctgccc 
catgctgtcc tccatgacgt 
cagtgtgtga cctccacccg 
cccaacctgg cctctgrtgct 
cctggccact tctcggacaa 
tctgcaggta ctagggtcta 
tttgccaccc tcctcaggac 
ctggagtaca tctttggcgg 
cgcctgagca gccccagccc 
aggcctgtcc cccatgaagt 
aggagga 



cggttgtctg ctatgaeigac 1020 
gcagcgcctc agcagcgtca 1080 
tgtgtgcagc tatcccgtga 1140 
ctatgaccct gagtgctggg 1200 
ggatggaaac tttgtggcca 1260 
cccagcagac cagctggctc 1320 
ctttcggttc caactgccag 1380 
cacttggcaa ccccagcccc 1440 
tggtcctagg gaggatggcc 1500 
ccttcctcag tctcttttgg 1560 

1587 
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actcgagaac cacatgcagg tgaggttcaa 
tacatgagag gcagaatgaa tgatacaaaa 
ccattacaag gtcatattgt tggaaaagtc 
ggcgattatg tcacaggcat attaccatgg 
gtgacccctg tgccatcaaa agatgtacca 
ccgggaatga cggcatatac aggattgctt 
gttgtcgtgt cagctgcatc aggtgcagta 
aaaggcgcaa aagttgtcgg tattgctggt 
gaattaggat ttgatgcggc cattgactat 
gcggctgtac cagatggtat tgatgtgtat 
gaagtgttta aacacttaeia tcgatttgca 
tataataatg aaaaagacga tattggacca 
gcattgatgc aaggttttgt agtagcacaa 
caactcgcac aatgggtgtc tgeuiggtaaa 
tttgacaatt taccttctgc attcagaaag 
gttgtcaaaa tcaaagaaga atag 



gtagaatcca tttatgtatc tgtagatcct 720 
agttatgttc aacctttcca agtgaatgaa 780 
acacaatcga acgatgaacg tctatcfcgtt 840 
aaaaagataa atacagtgaa tggagacgat 900 
ttacatttat atttgagtgt tttaggcatg 960 
caaattggtc aaccacaatc tggcgagacg 1020 
ggatctgtcg taggacaaat tgctaagatt 1080 
ggtaagcaga aaacaacata tttaacagat 1X40 
aaacaagatg atttcgcaca gcaactcgaa 1200 
tttgaaaatg taggcggcgt aatttctgat 1260 
cgcgttccgg tatgtggtgc aatttcagca 1320 
cgtatccaag gaacgttgat taaaaabcaa 1380 
ttcgctgatc attttaaaga agcaagcgaa 1440 
attaaattfcg aagtgacgat agatgaaggt 1500 
ttatttacag gtgataattt cggtaaacaa 1560 

1584 



<210> 25 
<211> 150D 
<212> mik 

<213> Hcsmo sapiens 
<220> 

<221> misc.fdature 

<223> Incyte ID No: 7479904CB1 



<400> 25 

atggatgaga aatccaacaa gctgctgcta gctttggtga tgctcttcct atttgccgtg 60 
atcgtcctcc aatacgtgtg ccccggcaca gaatgccagc tcctccgcct gcaggcgttc 120 
agctccccgg tgccggaccc gtaccgctcg gaggatgaga gctccgccag gttcgtgccc 180 
cgctacaatt tcacccgcgg cgacctcctg cgcaaggtag acttcgacat caagggc?gat 240 
gacctgatcg tgttcctgca catccagaag accgggggca ccactttcgg ccgccacttg 300 
gtgcgtaaca tccagctgga gcagccgtgc gagtgccgcg tgggtcagaa gaaatgcact 360 ' 
tgccadcggc cgggtaagcg ggaaacctgg ctcttctcca ggttctccac gggctggagc 420 
tgcgggttgc acgccgactg gaccgagctc accagctgtg tgccctccgt ggtggacggc 480 
aagcgcgacg ccaggctgag accgtccagg tggaggattt ttcagattct agatgcagca 540 
agtaaggata aacggggttc tccaaacact aacgcaggcg ccaactctcc gtcatccaca 600 
aagacccgga acacatctaa gagtgggaag aacttccact acatcaccat cctccgagac 660 
ccagtgt ccc ggtacttgag tgagtggagg catgtccaga gaggggcaac atggaaagca 720 
tccctgcatg tctgcgatgg aaggcctcca acctccgaag agctgcccag ctgctacact 780 
ggcgatgact ggtctggctg ccccctcaaa gagtttatgg actgtcccta caatctagcc 840 
aacaaccgcc aggtgcgcat gctctccgac ctgaccctgg taggctgcta caacctctct 900 
gtcatgcctg aaaagceiaag aaacaaggtc cttctggaaa gtgccaagtc aaatctgaag 960 
cacatggcgt tcttcggcct cactgagttt cagcggaaga cccaatatct gtttgagaaa 1020 
accttcaaca tgaactttat ttcgccattt acccagtata ataccactag ggcctctagt 1080 
gtagagatca atgaggaaat tccteiaagcgt attgagggac tgaattttct ggatatggag 1140 
ttgtacagct atgccaaaga cctttttttg cagaggtatc agtttatgag gcageiaagag 1200 
catcaggagg ccaggcgaaa gcgtcaggaa caacgcaaat ttctgaaggg aaggctcctt 1260 
cagacccatt tccagagcca gggtcagggc cagagccaga atccgaatca gaatcagagt 1320 
cagaacccaa atccgaatgc caatcagaac ctgactcaga atctgatgca gaatctgact 1380 
cagagtttga gccagaagga gaaccgggaa agcccgciagc agaactcagg caaggagcag 1440 
aatgataaca ccagcaatgg caccaacgac tacataggca gtgtagagaa atggcgtifcaa 1500 

<210> 26 

<211> 669 

<212> HNA 

<213> Homo sapietis 

<220> 

<221> iiidsc_£eatnire 

<223> Incyte ID No: 74803 67CB1 
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<400> 25 

atggcagaga agcccaagct ccactactcc 
tggctcctgg ctgcagctgg agtagagttg 
ttggacaagt taagaaatga tgggagtttg 
gacgggatga agctggtgca gaccagagcc 
ctttatggga aagacatgaa ggagagagcc 
gatttgactg aaatgatcct tcttctgctc 
actgcctbgg tcaaagageia aataaaaaat 
aagagccaca gacEiagacta ccttgttggc 
gtggaacttt tctactacgt ggaagagctt 
ctgaaggccc tgaaaaccag aatcagcaac 
ggcagccaga gaaagcctcc catggatgag 
aggttttaa 

<210> 27 
<211> 3551 
<212> DMA 

<213> HoBDD sapiens 
<220> 

<221> misc^feature 

<223> Iticyte ID No: 8069390CB1 

<400> 27 

ggcgcggagc agctccggcg gcgagacggg 
tttggaagac tttgccggcc tgcagattgg 
tgacjggccca gaacfcggcag agagaaggtt 
ttgtacaggg aaatcgccag gtcttgcaat 
gcctggtata cggccagaaa aagcatcact 
ctgcatttta tcccccgcct ggggagcaga 
gccgttgtca gcggtgcaac agatgggatt 
cgaggtctca atataatcct gattagtcgg 
gacatagccg acacgtacaa agtggaaact 
cgtgagatct accttccaat tcgagaagcc 
aataacgtgg gtgtgtttta tccctacccg 
ctctgggaca tcataaatgt gaacattgcc 
ccgggaatgg tggagagaaa gaaaggtgcc 
aaacccactc ctcagctggc tgcattttct 
agagccttgc aatatgaata tgcctctaaa 
tatgtagcca ccagcatgac agcacccagc 
ccttcgccaa aagtctatgc acatcatgct 
acaggatatt ggtcccattc tattcagttt 
tgggtgtggg gagcaaatat tctcaaccgt 
gcctgagtct ggatggccac ttgagaagtt 
acatttggaa aaacacattt aatttatctc 
tgttgatt eg tcatttgcaa aacacacata 
gggtgtgtgg atggcacagg atcaataatg 
agtgcttttt ccctaagctg tttgaaagtt 
gtctaatatt gaaatataat atgatttgtc 
agggaaattg accatttcac tagaggagtt 
actactgaaa agtattattt tgtaactaag 
gtattgtata gctctagctg tttgactatc 
tttcattaga gaaccgtgtg tgtgcgtgtg 
ttaccatatt tttaaaaaga ttfctttgtca 
ttciaaacaat ctagatbatt ccttttcatc 
ggtttgccat gaatgatatt acttcctaca 
gaatgaactt attcctagat ttggatatgt 
taaggcaatt cttgtggata aatagtgatt 
atttagccct ttagagcttg ttatatctgg 
ccattccagt agcattattg atagaaatag 
aeiattacgtg tgtgacatat ctgtcciaaat 
tgaatttatg tattgtagtg aataccttta 



aatgcacggg gcagtatgga gtccattcgg 60 
gaagagaaat ttctagaatc tgcagaagat 120 
ctgttccagc aagtaccaat ggttgagatt 180 
attcttctact acattgccag caaatacaac 240 
ctgattgata tgtacacaga aggtatagta 300 
atatgtcaac cagaggaaag agatgccaag 360 
cgctacttcc ctgcctttga aaaagtatta 420 
aacaagctga gctgggctga cattcacctg 430 
gactcgagtc ttatctccag cttccctctg 540 
ctgcccacgg tgaagaagtt tctgcagcct 600 
aaatctttag aagaagcaag gaagatttkc 660 

669 



ggcggcgccg cgcgggtctg gcgggaccgg 60 
ccttaagaga aggacggagc cacatactgc 120 
gccatggctg ctgttgacag tttctacctc 180 
tgctatatgg aagctctagc tttggttgga 240 
gtcatctgtg acttttacag cctgatcagg 300 
gcagacttga tcaagcagta tggaagatgg 360 
ggaaaagcct acgctgaaga gttagcaagc 420 
aacgaggaga agttgcaggt tgttgctaaa 480 
gatattatag ttgcggactt cagcagcggt 540 
ctgaaggaca aagacgttgg catcttggta 600 
cagtatttca ctcagctgtc cgaggacaag 660 
gccgctagtt .tgatggtcca tgttgtgtta 720 
atcgtcacga tctcttctgg ctcctgctgc 780 
gcttctaagg cttatttaga ccacttcagc 840 
ggaatctttg tacagagtct aatccctttc 900 
aactttctgc acaggtgctc gtggttggtg 960 
gtttctactc ttgggatttc caaaaggacc 1020 
ctttttgcac agtatatgcc tgaatggctc 1080 
tcactacgta aggaagcctt atcctgcaca 1140 
ttgccaactc ctgggaacct cgatattctg 1200 
ctgtgtttca ttgctgatta ttcagcatac 1260 
ataccgtcag agtgctgtga aaaaccttaa 1320 
cctgaggctg attgacgaca tctacatttc 1380 
acgcttttct gttgttctag agccacagca 1440 
aggtcttata atttcagatg ttgtttttta 1500 
gtgctggttt ttaaatgtgc atcaagaaag 1560 
attgctggta ctattaggaa aaatctgtgt 1620 
tgtaatgaaa atgctgcact tcaactggta 1680 
tgtggtgcct ttgagcaact ttatttatgg 1740 
gggtgactta acatggactc ttatagggta 1800 
cteiaataagc ctaccaaatt tcatgctgtt 1860 
ttatatttgt gttttttcaa atctgctatg 1920 
aagagaaacc tgcagtcatc ttttgattta 1980 
tctcagcctc tgacccattt tataactgaa 2040 
ttttcctacg tttttctatg taatattatt 2100 
taagtattta tggaatagta aaatatggac 2160 
aagttagaag cttattcttg gtttgtgtaa 2220 
ctggtgtgaa gataattatg cacaaaccct 2280 
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cacaatacgc gttaacattg aaacctgtga aatgtcctta ggttgggtca tataaagcca 2340 
accatttttg aggaccatgt acctagtgct ttgaaaactg taagtcacta tatgaatatg 2400 
acaatatgtg cacatttaaa attcagagct cggcattgtg atactgatgc agaagctagt 2460 
agattggtta aaagtctgga cttctgtggc atttttttcg tgacgtgata atctatcata 2520 
agcagaccta agcacagttt tatgaacaca attttgccca tgacattgcc tacaggattt 2580 
ccagatgtga cttgcactca gaagatcagt ggtcaacttc agaagttctt ccacgcttag 2640 
atcatgtctt cagaacttag atgtgaaaiat ctacacactg ggagatgctg tgagccccaa 2700 
ggttttgatg gagtttgctt ggaatcctct tgacttcatg ccacafctgac gtgaactttg 2760 
atgtatciata agcagcagca acttcatgtg aaaatatggt caggtagtta tatgtaaggt 2820 
tacgtggtcc agtaatgtct tagattgata aattaggtat ggaatccatc agtgttacgt 2880 
gatgagaata ggtgaacaca ccttgtcagt gatgatgtaa acttctctcc ttggcaggac 2940 
atgggcaaac atgctgattg gtgcaaatgt ggtgccgagc tgtccatagc tgcagtgaaa 3000 
gatgaagagc aagaccttct ctaggttttc tagctttcat taaatgtatt tttttcccca 3060 
gagctaattt gaaagttgat tggaccactg tggatggggt ctcattaaga atgtgggaaa 3120 
taggggccga gtgcggtggc tcacacctgt aatcccagca gtttggaagg ccagggcagg 3180 
tggatcgctt gatcccagga ggtcgagacc agcctgggga acacatcctg tctctacaaa 3240 
aaatacaaaa attagccagg cagggtggtg catgcctgta gtcccagcta cttgggaggc 3300 
tgaggcagga gaattttttg agcccaggat gcagaggttg eiagtgagcca agatcgtgcc 3360 
actgcactcc agccttgaga cagagcgaga ccctgtctca aaaaaaaaaa aagaacgtgg 3420 
gaaatatgaa cctttgaaag ttaatctgtg aattgaaagt ttaacaataa aagtagttgt 3480 
ttgtttcctt tgaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaatggcgg 3540 
tcgcaagctt a 3551 

<210> 28 . . 

<2ai> 2178 

<212> DMA 

<213> Hoodoo sapiens 

<220> 

<221> misc^feature 

<223> Incyte ID No: 7473869CB1 

<4JD0> 28 

tcgggtgggg agtagagtag gtaagcgtgt ttctttattg gcggagggta tgtgtaaccg 60 
agagggtttc actgccaatt atggaagtag cctctccttg atattgttga ggtgggattg 120 
tgcaacaaca ttgtactgtt gtgggttgta atttcgccta gatggcataa gtgagaactg 180 
aagacttcag gtaggatcag aggcacgggt cttgttagtt atgctcttgg gatcagtgtc 240 
gcctgctgag ctgaaaagga aatggattca atctcttcaa cctttaaggt gatagatagt 300 
ttgagcagac tggagaatgg acacactatg aagctgtggc tagaaaggga ctggtcatgt 360 
cccatcctct ggccagattg actggggatg tccggacaga tgcctgcatg ggtggtgagg .420 
gccacatctg cacacgagcc agtggctgct tgcagttcac tgctgtgatg ccagagtgtg 480 
ttc£iaaggtg actctcctgc tcttctggac tcttctctca ggcaagaaag gctgcaggct 540 
gcctgctatg tgatgcctga gcacaaagcc aaggaactga actaagtctt tctgtteiagt 600 
cctgagtttg tcattggcag ghttacttgt ggccagctct ctctgccctt gggggttccg 660 
tcttctcact gcggaccctg gattgaaacg atctccccgc ggccgccgcc gctacctggt 720 
gcccgcaggt gcctgcagga gtcctggggc cagctggcct cgatgtacgt cagcacgcgg 780 
gaacggtaca agtggctgcg cfctcagcgag gactgtctgt acctgaacgt gtacgcgccg 840 
gcgcgcgcgc ccggggatcc ccagctgcca gtgatggtct ggttcccggg aggcgccttc 900 
atcgtgggcg ctgcttcttc gbacgagggc tctgacttgg ccgcccgcga gaaagtiggtg 960 
ctggtgtttc tgcagcacag gctcggcatc ttcggcttcc tgagcacgga cgacagccac 1020 
gcgcgcggga actgggggct gctggaccag atggcggctc tgcgctgggt gcaggagaac 1080 
atcgcagcct tcgggggaga cccaggaaat gtgaccctgt tcggccagtc ggcgggggcc 1140 
atgagcatct caggactgat gatgtcaccc ctagcctcgg gtctcttcca tcgggccatt 1200 
tcccacfagtg gcaccgcgtt attcagactt ttcatcacta gtaacccact geiaagtggcc 1260 
aagaaggttg cccacctggc tggatgcaac cacaacagca cacagatcct ggtaaactgc 1320 
ctgagggcac tatcagggac caaggtgatg cgtgtgtcca acaagatgag attcctccaa. 1380 
ctgaacttcc agagagaccc ggaagagatt atctggtcca tgagccctgt ggtggatggt 1440 
gtggtgatcc cagatgaccc tttggtgctc ctgacccagg ggaaggtttc atctgtgccc 1500 
taccttctag gtgtcaacaa cctggaattc aafctggctct tgccttatat catgaagfctc 1560 
ccgctaaacc ggcaggcgat gagaaaggaa accatcacta agatgctctg gagtecccgc 1620 
accctgttgg tgagggaccc agctggcagg ggtgctcagt tcggacaggg ttgacccccc 1680 
tgtttttttt aacctagtag ctgctctttg caaaggggct cccagccagg gtaaggatct 1740 
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ttcttggagg gtctggggtt 
gggtcacagg ggcagttatg 
tcccaggaag ctccctttct 
atccaagtca ctggtgggcc 
ggcctcctca ctcagaatat 
gacaatgtca atgagcatga 
gatgccactt tcgtgtatgc 
cctgtctacc tgtatgaa 

<210> 29 
<211> 2081 
<212> DNA 
<213> Hooaao sapiens 

<220> 

<221> miscjeature 
<223> Incyte ID No: 747&588CB1 

<400> 29 

gctgtgggct ggtcagaagc tggttacaat tccccccgcc ccagtacttg ctggcaggga 60 
ttaagagcag ataa«iagtgt gctcacacac tgtagacacg gctaccatgc catccacagt 120 
gttgccatcc acagtgttgc catcactcct gcccacagca ggagctggct ggagcatgag 180 
gtggattctg tgctggagcc tcaccctctg cctgatggcg cagacggcct tgggtgcctt 240 
gcacaccaag aggcctcaag tggtcacceia atatggaacc ctgcaaggcia aacagatgca 300 
tgtggggaag acacccatcc aagtcttttt aggagtcccc ttctecagae ctcctctagg 360 
tatcctcagg tttgcacctc cagaaccccc ggagecctgg aeiaggaatca gagatgctac 420 
cacctacccg cctgggtgcc tgcaggagtc ctggggccag ctggcctcga tgtacgtcag 480 
cacgcgggaa cggtacaagt ggctgcgctt cagcgaggac tgtctgtacc tgaacgtgta 540 
cgcgccggcg cgcgcgcccg gggatcccca gctgccagtg atggtctggt tcccgggagg 600 
cgccttcatc gtgggcgctg cttcttcgta cgagggctct gacttggccg cccgcgagaa 660 
agtggtgctg gtgtttctgc agcacaggct cggcatcttc ggcttcctga gcacggacga 720 
cagccacgcg cgcgggaact gggggctgct ggaccagatg gcggctctgc gctgggtgca 780 
ggagaacatc gcagccttcg ggggagaccc aggaaatgtg accctgttcg gccagtcggc 840- 
gggggccatg agcatctcag gactgatgat gtcaccccta gcctcgggtc tcttccatcg 900 
ggccatttcc cagagtggca ccgcgttatt cagacttttc atcactagta acccactgaa 960 
agtggccaag aaggttgccc acctggctgg atgcaaccac aacagcacac agatcctggt 1020 
aaactgcctg agggcactat cagggaccaa ggtgatgcgt gtgtccaaca agatgagatt 1080 
cctccaactg aacttccaga gagacccgga agagattatc tggtccatga gccctgtggt 1140 
ggatggtgtg gtgatcccag atgacccttt ggtgctcctg acccagggga aggtttcatc 1200 
tgtgccctac cttctaggtg tcaacaacct ggaattcaat tggctcttgc cttatatcat 1260 
gaagttcccg ctaaaccggc aggcgatgag aaaggaaacc atcactaaga tgctctggag 1320 
tacccgcacc ctgttgaata tcaccaagga gcaggtacca cttgtggtgg aggagtacct 1380 
ggacaatgtc aatgagcatg actggaagat gctacgaaac cgtatgatgg acatagttca 1440 
agatgccact ttcgtgtatg ccacactgca gactgctcac taccaccgag atgccggcct 1500 
ccctgtctac ctgtatgaat ttgagcacca cgctcgtgga ataatcgtca aaccccgcac 1560 
tgatggggca gaccatgggg atgagatgta cttcctcttt gggggcccct tcgccacagg 1620 
cctttccatg ggtaaggaga aggcacttag cctccagatg atgaaatact gggccaactt 1680 
tgcccgcaca ggaaacccca atgatgggaa tctgccctgc tggccacgct ac£iacaagga 1740 
tgeiaaagtac ctgcagctgg attttaccac aagagtgggc atgaagctca aggagaagaa 1800 
gatggctttt tggatgagtc tgtaccagtc tcaaagacct gagaagcaga ggcaattcta 1860 
agggtggcta tgcaggaagg agccaaagag gggtttgccc ccaccatcca ggccctgggg 1920 
agactagcca tggacatacc tggggacaag agttctaccc accccagttt agaactgcag 1980 
gagctccctg ctgcctccag gccaaagcta gagcttttgc ctgttgtgtg ggacctgcac 2040 
tgccctttcc agcctgacat cccatgatgc ccctctactt c 2081 

<210> 30 
<211> 2642 

<2i2> rm 

<213> Hcano sapiens 
<220> 

<221> iiiisc_f ©ature 



tgctgtggga tcagatgact gcttacaggt aaggtgctca 1800 
cagcaaaatc aggggttaca atcagcagag acagaaactt 1860 
ccccctccca ggccaaaaac tcctgggggg ctgagcatgg 1920 
cacctctggc ccagctggca cccaggcctc aggtaagtgt 1980 
caccaaggag caggtaccac ttgcggtgga ggagtacctg 2040 
ctggaagatg ctacgaaacc gtatgatgga catagttcaa 2100 
cacactgcag actgctcact accaccgaga tgccggcctt 2160 

2178 
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<223> Tucyte ID No: 55046125CB1 
<400> 30 

gagccggccc gggtcggtcg ccgcctcagt tcgcgcgggc cctcctaggg gtgtgtctca 60 

cggattccac acccggccgc tcctggacaa gccccgaaag gcgtcttctt ccctggcggg 120 

agccgcgtgc gccccgcttt tcgcgctgct gtcccggggc cgccgcaggc ggatgcacgt 180 

cctcaggcga cgctgggacc tgggctccct ctgccgggcc ctgctcactc ggggcctggc 240 

cgccctgggc cactcgctga agcacgtgct cggtgcgatc ttctccaaga ttttcggccc 300 

catggccagc gtcgggaaca tggatgagaa atccaacaag ctgctgctag ctttggtgat 360 

gctcttccta tttgccgtga tcgtcctcca atacgtgtgc cccggcacag aatgccagct 420 

cctccgcctg caggcgttca gctccccggt gccggacccg taccgctcgg aggatgagag 480 

ctccgccagg ttcgtgcccc gctacaattt cacccgcggc gacctcctgc gcaaggtaga 540 

cttcgacatc aagggcgatg acctgatcgt gttcctgcac atccagaaga ccgggggcac 600 

cactttcggc cgccacttgg tgcgbeiacat ccagcbggag cagccgtgcg agtgccgcgt 660 

gggtcagaag aaatgcactt gccaccggcc gggtaagcgg gaaacctggc tcttctccag 720 

gttctccacg ggctggagct gcgggttgca cgccgactgg accgagctca ccagctgtgt 780 

gccctccgtg gtggacggca agcgcgacgc caggctgaga ccgtccagga acttccacta 840 

catcaccatc ctccgagacc cagtgtcccg gtacttgagt gagtggaggc atgtccagag 900 

aggggcaaca tggaaagcat ccctgcatgt ctgcgatgga aggcctccaa cctccgaaga 960 

gctgcccagc tgctacactg gcgatgactg gtctggctgc cccctcaaag agtttatgga 1020 

ctgtccctac aatctagcca acaaccgcca ggtgcgcatg ctctccgacc tgaccctggt 1080 

aggcbgctac aacckctctg tcatgcctga aaagcaaaga aacaaggtcc ttctggaaag 1140 

tgccaagtca aatctgaagc acatggcgtt cttcggcctc actgagtttc agcggaagac 1200 

ccaatatctg tttgagaaaa ccttcaacat gaactttatt tcgccattta cccagtataa 1260 

baccactagg gcctctagtg tagagatcaa tgaggaaatt caaaagcgta ttgagcfgacfc 1320 
gaattttctg gatatggagt tgtacagcta tgccaaagac ctttttttgc agaggtatca 1380 

gtttatgagg cagaaagagc atcaggaggc caggcgaaag cgtcaggaac aacgcciaatt 1440 
tctgaaggga aggctccttc agacccattt ccsagagccag ggtcagggcc agagccagaa 1500 

tccgaatcag aatcagagtc agaacccacia tccgaatgcc aatcagaacc tgactcagaa 1560 

tctgatgcag aatctgachc agagtttgag ccagaaggag aaccgggaaa gcccgaagca 1620 

gaactcaggc aaggagcaga atgataacac cagcaatggc accaacgact acataggcag 1680 

tgtagagaaa tggcgttaaa tggctcaaaa aggcctgtac atacttctcc caaagcgcca 1740 

ctgaaaagat ggcatagctt ciaaagatgaa aghgtccaaa cacatcctgc ttccttcatt 1800 

ggggaagttt taaaaaaaag tttagatgtt gcctttacag ttgcctttca attcagtgtt 1860 
atactgtgtg taggtaaaac aaatctcaat atggaattaa attgtctttt tggggttgga 1920 
ctaaatatga aatccgaaag ccaaaccaga ctcaccagaa attgctgttt agatatttta 1980 

agaagttctt aaattagtta tggagacaaa gtgaaaacat aaaatgtgac catttaactt 2040 

atggctaaga aatggacttt aaattattca tgatacactg ttaaaaccca atcttggaat 2100 

caaatatttt ttccaggggt gagaataagt ataaacataa agcaactaaa atgaaacata 2160 

aaacctttta ttttcttctg attttaacaa ggaatctatt taaatagaat aacaactgat 2220 

ggtgaatctt accgagctgt agaaaataaa aaattcctct ccaaacatgg gtagttttat 2280 
gtcaaaatat tggcttttca agaacaggac tcatatcttg atatttaaga gatgtttaaa 2340 

attttaaact ttttctacct tctactgttt aaaggtttta cacagggtgt atctcacatt 2400 

aaacaaaaca ccbtbtttbc Eiaaabgaaat accaatgtcia agatctaatb tccaggcgct: 2460 

ttcagggcac tgtciatttca acaatactgg aatcattttg gcgctgcttc tcattcattt 2520 

taaggcttct ctgaattgtg ctcattccaa attaacccat gtatagaatc tttcttcatc 2580 

atctaaatgg tgtgttgcbg aagbbabbgb ggtababaat ccbggabbaa agbcaggacb 2640 

bb 2642 

<210> 31 
<211> 2080 
<212> UNA 

<213> Hcsno sapiens " 
<220> 

<221> iidsc_feabure 

<223> Incybe ID No: 3538709CB1 

<400> 31 

bgbbtcggcg gccgcgggab gccccbgcgc bgaccgccag gggcaggbgc ccgcccgcgt 60 

agacgcaccc ggccbgaccc cgcgccacca bgbaaacagc gccagcaggc ggacgcbggc 120 

bbcbccgccb gggaccccbc cgccccgacc cgggccccgc ggcccbcgat gaggacacac 180 
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catgctgacc ggggtgaccg acggtatctt ctgttgcctg cfcgggcacgc cccccaacgc 240 
cgtggggcca ctggagagcg tcgagtccag cgatggctac acctttgtag aggtcaagcc 300 
cggccgcgtg ctgcgggtga agcatgcagg acccgcccca gccgctgccc cacctccacc 360 
atcatccgca tcctcggatg cagcccaggg ggacctctcc ggcttggtcc gctgtcagcg 420 
ccggatcacc gtgtaccgca atgggcggtt gctggtggaa aacctgggcc gagcccctcg 480 
agccgacctc ctacacgggc agaatggctc tggggagccg ccggccgccc tggaggtgga 540 
gctggcagat ecggcgggca gcgatggccg cttggccccc ggcagcgcag gcagcggcag 600 
cggcagtggc agtggtgggc ggcggcggcg agccaggcgc cccaagagga ccatccatat 660 
tgactgtgag aagcgcatca ctagctgcaa aggcgcccag gccgacgtgg tgctcttttt 720 
catccatggt gtcggcggtt ccctggccat ctggaaggag cagctggact tctttgtgcg 780 
cctaggctat gaggtggtgg ctcctgacct ggccggccac ggggccagct ctgcgcccca 840 
ggtggccgca gcctacacct tctatgcgct ggctgaggac abgcgagcaa tcttcaagcg 900 
ctatgccaag aagcgaaatg tgctcattgg ccattcctac ggtgtctctt tctgcacatt 960 
cctggcacat gagtacccag acctagtgca caaggtgatc atgatcaatg gcgggggccc 1020 
tacggcgctg gagcccagct tctgctcaat cttcaacatg cccacctgcg tcctgcactg 1080 
cttgtcgccc tgcctggcct ggagcttcct caaggccggc ttcgcccgcc aaggagccaa 1140 
ggagaagcag ctgttaaagg agggcaacgc tttcaacgtg tcatccttcg tactccgggc 1200 
catgatgagc ggccagtact ggcccgaggg cgacgaggtc taccacigGcg agctcaccgt 1260 
gcccgtcctg cttgtccacg gcatgcacga taagtttgtg ccggtggagg aagaccagcg 1320 
catggccgag atcctgctcc tggcattcct gaagctcatc gacgagggca gccacatggt 1360 
gatgctggaa tgccctgaga cggtcaacac gctgctccac gaattcctgc tctgggagcc 1440 
cgagccctcg cccaaggctc taccggagcc actgccggcg cc tccagaag acaagaagta 1500 
gccgctgggc cggcggggca tcgcttggtg agcacagccg cagcaggagg aggcccgagc 1560 
ctgcgccagg tctgcagcgc agaccacctg ggcgggccgt tcgctccggt gggcggggcc 1620 
aggtcaggga gacgccccca ggctgcctgg gcggggcgtg gcatccgagg gagcccagcg 1680 
gacattccgc tctccgcttc cgtcccgcgg ggcccatcgg cgttttgggg ccgcagccgg 1740 
gaccctcacg gaagatgacc ttgtacagaa gctctccctc accttccccc caacgccacg 1800 
gccaaggcag gccccccacc ccgctgtctt ccgtgtcagc cgtgcttgat cctgggaccc 1860 
acgagcccca cagggaccct cgaggcccca tcccgttatc cgagaccctt cctacccccc 1920 
attcctcggc gctgggagct atttttgcoc aagggggggg gatggggggg ctggcgccac 1980 
cgaacctgca catctcaact tgtaactcaa taaacagaag tgacaatcgg aaaaaaaaaa 2040 
aaaaaaaaaa aaaaAffiaaaa aaaaaaaaag aaaaaaaeteia- 2080 

<210> 32 . ■ 
<211> 2219 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_featiire 

<223> Incyte ID No: 71563101CB1 

<400> 32 

ggcctcceiat ctgcttccat gggggttggc tttctgagtg ggagaaatga ctctaatctg 60 
gagacatttg ctgagaccct tgtgcctggt cacttecgct cccaggatcc ttgagatgca 120 
tcctttcctg agcctaggta cttcccggac atcagtiaacc aagctcagtc ttcatacaaa 180 
gcccagaatg cctccatgtg acttcatgcc tgaaagatac cagtcccttg gctacaaccg 240 
tgtcctggaa atccacaagg eacatctttc tcctgtggtg acggcatatt tccagaaacc 300 
cctgctgctc caccaggggc aeatggagtg gctctttgat gctgaaggaa acagatacct 360 
ggatttcttt tccgggattg ttactgtcag tgttggccac tgccacccaa aggtgaatgc 420 
aigtggcacaa aagcagctcg gccgcctghg gcatacaagc accgtcttct tccaccctcc 480 
aatgcatgaa tatgcagaga agcttgccgc acttcttcct gagcctctta aggtcatttt 540 
cttggtgaac agtggctcag aagccaatga gctggccatg ctgatggcca gggcgcactc 600 
aaacaacata gacatcattt ctttcagagg agccfcaccat ggatgcagtc cttacacact 660 
tggcttgaca aacgtaggga tctacaagat ggaactccct ggtgggacag gttgccaacc 720 
aacaatgtgt ccagatgttt ttcgtggccc ttggggagga agecactgtc gagattctcc 780 
agtgcaaaca atcaggaagt gcagctgtgc accagactgc tgccaagcta aagatcagta 840 
tattgagcaa ttcaaagata cgctgagcac atctgtggcc aagtcaattg ctggattttt 900 
cgcagaacct attcaaggtg tgaatggagt tgtccagtac ccaaaggggt ttctaaagga 960 
agcctttgag ctggtgcgag caaggggagg cgtgtgcatt gcagatgaag tgcagacagg 1020 
atttggaagg ttgggctctc acttctgggg cttccaaacc cacgatgtcc tgcctgacat 1080 
tgtcaccatg gctaaaggga ttgggaatgg ctttcccatg gcagcagtca taaccactcc 1140 
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agagattgcc aaatctttgg cgaaatgcct gcagcacttc aacacctttg gagggaaccc 1200 
catggcctgt gccattggat ctgctgtgct tgaggtgatt aaagaagaaa atctacagga 1260 
aaacagtc^ gaagttggga cctacatgtt actaaagttt gctaagctgc gggatgaatt 1320 
tgaaattgtt ggagacgtcc gaggcaaagg cctcatgata ggcatagaaa tggtgcagga 1380 
taagataagc tgtcggcctc ttccccgtga agaagtaaat cagatccatg aggactgcaa 1440 
gcacatggga ctcctcgttg gcagaggcag cattttttct cagacatttc gcattgcgcc 1500 
ctcaatgtgc atcactaaac cagaagttga ttttgcagta gaagtatttc gttctgcctt 1560 
aacccaacac atggaaagaia gagctaagta acattgtcag aaataaataa aaccacaagt 1620 
ctcaagaatt tgccacgtat gttcaagggt gaatttgaag aatttcagaa ccactggtafc 1680 
ccagagaaag cctgcagctc tccacaggag ctgteiaaagt catggttgac tgcctaccaa 1740 
ccatatttgt tagcagagcc cctcttatct tgagaactcc attcttcagg gaaaggatct 1800 
ceetagctca gagaataaat cctaattagt ttatgttagg tatggtaatt tgattcccct 1860 
ttgcagtgat tggtttatgc atgaatatgt gatgtatttt tgtccagtga atcttgaaga 1920 
oaaatctttt ggtggaggtg ccttcsaggga aagttttctt caccctcact cttcagttca 1980 
agaagagatg tcttcttgtt gcgctgagaa caccatatgt tcatgacgag attcctggca 2040 
ccatgtcagc cggcttgtag tcatgaggac aacccttttt ggtgaggttg gaagatggat 2100 
ggaagccaag tgcttagtga tgtceiaagaa gcactcactt aagcattcct ggagccaccc 2160 
tacctcaggg cctcttgata tttgaggfcM taaaattcat tgttctgtat nanannnna 2219 

<210> 33 

<211> 681 

<212> ENA 

<213> Hcamo sapiens 

<220> 

<221> misc^featiire 

<223> Incyte ID No: 7472027CB1 

<400> 33 

atgagactat gtgaaaagac cgaactacag ttgattggtg tacctgaaag tgacagggag 60 
aatggaacca agttggaaaa cacatttcag gatattatcc aggagaactt ccccaaccta 120 
gcaagacagg ccaacattca aattcagatg gctggtggat ctatctggat tgaagggatt 180- 
ccttttccca gcaataattt tacagacctg agacgtttgc oagatgaaat tgtgttgcgg 240 
gatgaagatg tcattacact ttcttaccca aagtcaggaa gcttttggat agtggaaatc 300 
atcagtctga tccactccaa gggagatcct agttgggtcc siatctgttgt tccctgggat 360 
cgttcaccat ggatagaagt taaacgtaag aaagcaggtt tagagagtca gaagggccca 420 
cacctctaca cctcccacct tcccattcag ctcttcccca agtcattctt gaattccaag 480 
gccaagtgta tttatcctca tgttctcatg cttgtggttc tcatcctagg acataagagc 540 
cagtggagta ttgctataaa gatatctgaa aatgcagaag caacttcaaa actgggtaat 600 
gggcaaagat tggaagagtt tggagggctc agaagaagac aggaagatga aagaagtttg 660 
gaatttctta gagactgtta a 

<210> 34 
<211> 399 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> iBisc_feature 

<223> Inc^e ID Ho: 7480358CB1 

<400> 34 

agcccagacc agagtgtggc ccctggagtc ataatgatga aggtcatgta catgttgaag 60 
ggccagagcc cggtgcaggg caccatccac tttgagcaga aggaaaatga acciatttatg 120 
gtgtcagaat gcattacagg attgactgaa cgccagcaca gattccatgt tcatcagttt 180 
ggagataata caccaggctg taccagggca gttccttact ttaatccttt aaccaaaaac 240 
cacagtgggc caaggatcaa gagaggcagg ttggagacct gggtaatgtg gccgctggca 300 
aagatgtgtc gccacatgtc tgttgaagat tctctggtct cactctcagg acaetattcc 360 
atcactgccc acacaatggt gtccatgaca accagatga 399 

<210> 35 
<211> 2302 
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<212> DMA 

<213> Homo sapieni? 

<220> 

<221> miscfeature 

<223> liicyte ID No: 1618256CB1 

<400> 35 

taactaatcg gaaggcgetg tgaatttcac gtttctcgcg tagctgtaac gcgcggtata 60 
gttcttactc aagtcgtgcc ctgccagaaa ggggaagatg cagcgaggga gagacctaga 120 
accgcgatca aagctgcagc agctgctggg gctggcggca caaagggagg agggaggagc 180 
ctgggcgctg agaaagttct tggggaaagt tgagctgagc caagagtcgg gcggtggctg 240 
ggatgggcgg gagggcccgg cccgatttcc cttgctgctc cccttgtggc ctgacgctga 300 
cagaggcaaa aatctgctaa ctcagggggc agactcaacc aagactgtga gcaggcctgg 360 
ggaatgaccc cccgatctcc aaccagtgcc ttccgcagct gcacggctgt ctccagctgt 420 
ctctgcccct cttcctggcc ctggctccat ctctctgtca cctcaccctt ccctgtgcca 480 ? 
catgggccct ctctctcctg ccaggacgct gcggctctgg ggacctcgga gcctgggggt 540 
ggctctggga gtcttcatga ccattggctt tgcactccag ctcttgggag ggcccttcca 600 
gaggaggcta cctgggctac agctccgaca gccctcggcc ccatccctac gaccagccct 660 
tccgtcctgc ccaccccgrgc agcgactggt gttcctgaag acacataaat ccgggagcag 720 
ctctgtgctg agcctgcttc accgctatgg ggaecagcac gggctgcgct tcgccctcec 780 
tgcccgctac cagtttggct acccaaagct cttccaggcc tctagggtaa aaggctaccg 840 
cccacagggrt ggaggcaccc agctcccctt ccacatcctc tgtcaccaca tgaggttcaa 900 
cctgaaagag gtacttcagg tcatgccttc tgacagcttc ttttttteca ttgtccgaga 960 
cccagcggct ctggctcgct ctgcettctc ctactataaa tccacctcat cagcctbccg 1020 
caagtcacca tctttggctg ccttcctggc caatcctcga ggcttctaca ggcctggggc 1080 
ccgtggggac cactacgctc gcaacttact atggtttgac tttggcctgc cctttccccc 1140 
agagaagagg gccaagagag ggaatattca tccccccaga gaccccaacc ccccacagct 1200' 
gcaggtcttg ccttctggtg ctggccctcg agcccaaacc ctcaatccca atgccctcat 1260 
ccatcctgtt tccactgtta ctgatcafccg. cagccagata tcaagccctg cctctttcga 1320 
tttggggtct tcatccttca tccagtgggg tctggcctgg ctggactctg tctttgacct 1380 
ggtcatggtg gctgagtact tcgatgagtc attggttctg ctggcagatg ccctgtgctg 1440 
gggtctagat gacgtggtgg gcttcatgca caatgcccag gctggacata agcagggcct 1500 
cagcactgtc agcaacagtg gactgactgc ggaggaccgg cagctgactg cacgggcccg 1560 
agcctggaac aacctggact gggctctcta tgtccacttc aaccgcagtc tctgggcacg 1620 
gatagagaaa tacggccagg gccggctgca gacagctgtg gccgagctcc gggctcgccg 1680 
agaggcccta gcgaaacatt gtctggtagg gggtgaggct tctgacccca aatacatcac 1740 
tgatcgccgg ttccgcccct tccagtttgg gtcagctaag gttttgggct atatacttcg 1800 
gagtggattg agcccccaag accaagagga afcgtgagcgc ctagctaccc ctgagctcca 186^0 
gtacaaggac aagctggatg tcaagcagtt cccccctacc gtctcactgc ccctcaagac 1920 
ttcaaggcca ctctccccat aaacatcaga ctacagattt aggtggaaga gcagccatgt 1980 
ttgaagggca catgtgatga gtggggggca gcaagatgcc atttctgcat ctcccagaag 2040 
ggatgagtct ttgtcccgat gcaagccccc tcttcgctgg gctcccagca gtgcttccct 2100 
cctccaccct ccactcattt tgttctttcc ccccaacttt tttttttttg aaacggagtc 2160 
ttgctctgtc ccccaggctg gagtgcagtg gcatgatctc ggctcactgc aacctctgcc 2220 
tcccaggttc aagcgattct cctgcctcag cctccagagt agctaggatt acagatacgt 2280 
gccaccatac ccggctaatt tt 2302 

<210> 36 

<211> 3341 

<212> DMA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 3387823CB1 

<400> 36 

gaaacgaaga gaaccctctg tccctggcag aatctgcatg tacatttctt gtctgtcctt 60 
gtctctcttc ttcctgtctg gcccattgca gagagtattg gaagtttcca accattggtg 120 
gtactctatg ctcatcctac ctcctttgct gzteiagacagt gtcrgcagcgc ccctgctgtc 180 
tgcctactac cctgactgtg ttggcatgag cccctcctgc accagcacaa accgcgccgc 240 
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tgccactggc aatgccagcc ctgggaagct ggagcactcc aaggctgccc tctccgtgca 300 
cgttccaggc atgaaccgat acttccagcc tttctaccag cccaatgagt gtggcaaagc 360 
cctctgtgtg aggccggatg tgatggaact ggatgagctc tatgagtttc cagagtattc 420 
ccgagacccc accatgtacc tggctttgag aaacctcatc ctcgcactgt ggtatacteia 480 
ctgcaaagaa gctcttactc ctcagaaatg tattcctcac atcatcgtcc ggggtctcgt 540 
gcgtattcga tgcgttcagg aagtggagag aatactgtat tttatgacca gaaaaggtct 600 
catcaacact ggagttctca gcgtgggagc cgaccagtat cttctcccta aggactacca 660 
caataaatca gtcatcatta tcggggctgg tccagcagga ttagcagctg ctaggcaact 720 
gcataacttt ggaattaagg tgactgtcct ggaagccaaa gacagaattg gaggccgagt 780 
ctgggatgat aaatctttta aaggcgtcac agtgggaaga ggagctcaga ttgtcaatgg 840 
gtgtattaac aacccagtag cattaatgtg tgaacaactt ggcatcagca tgcataaatt 900 
tggagaaaga tgtgacttaa ttcaggciagg tggaagaata actgacccca ctattgacaa 960 
gcgcatggat tttcatttta atgctctctt ggatgttgtc tctgagtgga gaaaggataa 1020 
gactcagctc caagatgtcc ctttaggaga aaagatagaa gaaatctaca aggcatttat 1080 
taaggaatct ggtatcceiat tcagtgagct ggagggacag gtgcttcagt tccatctcag 1140 
taacctggag tacgcctgtg gcagcaaccfc tcaccaggta tctgctcgct cgtgggacca 1200 
caatgaattc tttgcccagt ttgctggtga ccacactctg ctaactcccg ggtactcggt 1260 
gataattgaa aaactggcag aagggcttga cattcaactc aaatctccag tgcagtgtat 1320 
tgattattct ggagatgaag tgcaggttac cactacagat ggcacagggt attctgcaca 1380 
aaaggtatta gtcactgtac cactggcttt actacagaaa ggtgccattc agtttaatcc 1440 
accgttgtca gagaagaaga tgaaggctat caacagctta ggcgcaggca tcattgaaaa 1500 
gattgccttg caatttccgt atagattttg ggacagtaaa gtacaagggg ctgacttttt 1560 
tggtcacgtt cctcccagtg ccagcaagcg agggcttttt gccgtgttct atgacatgga 1620 
tccccagaag aagcacagcg tgctgatgtc tgtgattgcc ggggaggctg tcgcatccgt 1680 
gaggaccctg gacgacaaac aggtgctgca gcagtgcabg gccacgctcc gggagctgtt 1740 
caaggagcag gaggtcccag atcccacaaa gtattttgtc actcggtgga gcacagaccc: 1800 
atggatccag atggcataca gttttgtgaa gacaggtgga agtggggagg cctacgatat 18 60 
cattgctgaa gacattcaag gaaccgtctt tttcgctggt gaggcaacaa acaggcattt 1920 
cccacaeiact gttacagggg catatttgag tggcgttcga gaagcciagca agattgcagc 1980 
attttaagaa ttcggtggac ccagctttct tctgtacccc agatggggaa atttgaatca 2040 
catgttaaac ctcagtttta taagaggggg aaaaaaccgt cfcctacatag taaaactgaa 2100 
atgtttctaa ggcgatatga taatgcaaac ctatttcatc actctaaaag cactgacctc 2160 
aaaaaacctt ataagcactt agatttaatt gcattttcca taggttcaac tactgctgaa 2220 
agtctggatt tcagaataaa gcagaatgta agtttcagtt gaggccatgg atttgattgt 2280 
tccatggctg gaagttccct ttagatttca cattttatat ggctgatcaa ttttcataca 2340 
ttgagaaacc aagtcaatca agcaggaatc atttaaaaac cagataaagc catgtttttc 2400 
ttctgtgaca atttatcagt atctttacca atgagcctta atttttatat aggtccaata 2460 
ttgagctttt acttaaaatt tagatagaac ttttttttgg atacagcaca aiactccagtt 2520 
gacagtaaaa tgaagcttct aggtattttg tattgtacat atttcctcct actgggtgtt 2580 
caaaagaaat ttaaattcaa gtaccttttg tgataaaatg ttttagattt gtgcacccat 2640 
tggcaaaaca ggaaagtttc cagataggta ttgtatcatt gagaatgcag cacagatagt 2700 
gtgggcttc!a cactatagac acagaatata gctttttctt aaagccaciat ttgggtgata 2760 
ggacacttta aatatcctta attttggcaa ccactagcaa aaaaaacttg tcagaataat 2820 
ttaaccaagc ccctctccac ttcttttatt taaaagcact gattcaattg ctaggaatat 2880 
ttttgcagat ttttctttac agtattccat aggcaggtcc actggaaaac tgcagaaaaa 2940 
tgtgagctct cctggtaaat agtatacatt ttataagcta tattttaaag gcctaagaac 3000 
atggcgagta tttactttta tctttttttt aaaaacactc atgacagaaa acagtctaat 3060 
aatatctcat tctaaaataa aacactggtt gcagggtctt caggatgcct attttgccag 3120 
aaacttcagt atacaggtta gaaatatgct tttgtttttg aacataatat actggtttgc 3180 
tttaaagaag ggactaaata tgactttaaa gagacttcaa atattgagta tttta a aaat 3240 
ktaaaagtag gbcagtttat aacgagtaaa tacctaacac ccaagaatgb gcagtgaacc 3300 
tcaggcgggg atccttagtt ctaacggccg ccccgtaggc g 3341 

<210> 37 

<211> 777 

<212> DNA 

<213> HooDO sapiens 

<220> 

<221> misc_featiire 

<223> Incyte ID No: 55142051CB1 
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<400> 37 

atgtcccctg ccattgcatt ggccttcctg ccactggtgg taacattgct ggtgcggtac i50 
cggcactact tccgattgct ggtgcgcacg gtcttgctgc geiagcctccg agactgcctg 120 
tcagggctgc ggatcgagga gcgggccttc agctacgtgc tcacccatgc cctgcccggt 180 
gaccctggtc acatcctcac caccctggac cactggagca gccgctgcga gtacttgagc 240 
cacatggggc ctgtcaaagg tcagatcctg atgcggctgg tggaggagaa ggcccctgct 300 
tgtgtgctgg aattgggaac ctactgtgga tactctaccc tgcttattgc ccgagccctg 360 
ccccctgggg gtcgccttct tactgtggag cgggacccac gcacggcagc agtggctgaa 420 
aaactcatcc gcctggccgg ctttgatgag cacatggtgg agctcatcgt gggcagctca 480 
gaggacgtga tcccgtgcct acgcacccag tat cage tga gtcgggcaga cctggtgctc 540 . 
ctggcacacc ggccacgatg ttacctgagg gacctgcagc tgctggaggc ccatgcccta 600 
ctgccagcag gtgccaccgt gctggctgac catgtgctct tccctggtgc accccgcttc 660 
ttgcagtatg ctaagagctg tggccgctac cgctgccgcc tccaccacac tggccttcca 720 
gacttccctg ccatcaagga tggaatagct cagctcacct atgctggacc aggctga 777 

<210> 38 

<211> 3600 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> Incyte ID No: 7395274CB3. 

<400> 38 

gagagcagag caggacactg gcgccgcggg tcaggcagct gcgtgcgcgt ctcctccagg 60 
oagcaagggg aacccgaggc cgccggcgcc cggaccatgt cgtctccggg gccgtcgcag 120 
ccgccggccg aggacccgcc ctggcccgcg cgcctcctgc gtgcgcctct ggggctgctg 180 
cggctggacc ccagcggggg cgcgctgctg ctatgcggcc tcgtagcgct gctgggctgg 240- 
agctggctgc ggaggcgccg ggcgcggggc atcccgcccg ggcccacgcc ctggcctctg 300 
gtgggcaact tcggtcacgt gctgctgcct cccttcctcc ggcggcggag ctggctgagc 360 
agcaggacca gggccgcagg gattgatccc tcggtcatag gcccgcaggt gctcctggct 420 
cacctagccc gcgtgtacgg cagcatcttc agcttcttta tcggccacta cctggtggtg 480 
gtcctcagcg acttccacag cgtgcgcgag gcgctggtgc agcaggccga ggtcttcagc 540 . 
gaccgcccgc gggtgccgct catctccatc gtgaccaagg agaagggggt tgtgtttgca 600 
cattatggtc ccgtctggag acaacaaagg aagttctctc attcaactct tcgtcatttt 660 
gggttgggaa aacttagctt ggagcccaag attattgagg agttcaaata tgtgaaagca 720 
gaaatgcaaa agcacggaga agaccccttc tgccctttct ccatcatcag caatgccgtc 780 
tctaacatca tttgctcctt gtgcthtggc cagcgctttg attacactaa tagtgagttc 840 
aagaaaatgc ttggttttat gtcacgaggc ctagaaatct gtctgaacag tcaagtcctc 900 
ctggtcaaea tatgcccttg gctttattac cttccctttg gaccatttaa ggaattaaga 960 • 
caaattgaaa aggatataac cagtttcctt aaaaaaatca tcaaagacca tcaagagtct 1020 
ctggatagag agaaccctca ggactfccata gacatgtacc ttctccacat ggaagaggag 1080 
aggaaaciata atagtaacag cagttttgat gaagagtact tattttatat cattggggat 1140 
ctctttattg ctgggactga taccacaact aactctttgc tctggtgcct gctgtatatg 1200 
tcgctgaacc ccgatgtaca agaaaaggtt catgaagaaa ttgaaagagt cattggcgcc 1260 
cLaccgagctc cttccctcac agacaaggcc cagatgccct acacagaagc caccatcatg 1320 
gaagtgcaga ggctaactgt ggtggtgccg cttgccattc ctcatatgac ctcagagaac 1380 
eu^agtgctcx: aagggtatac cattcctaaa ggcacattga tcttacccaa cctgtggtca 1440 
gtacatagag acccagccat ttgggagaaa ccggaggatt tctaccctaa tcgatttctg 1500 
gatgaccaag gacaact£iat taaaaaagaa acctttattc cttttgggat agggaagcgg 1560 
gtgtgtatgg gagaacaact ggcaaagatg gaattattcc taatgtttgt gagcctaatg 1620 
cagagtttcg catttgcttt acctgaggat tctaagaagc ccctcctgac tggaagattt 1680 
ggtctaactt tagccccaca tccatttaat ataactattt caaggagatg aagagcatct 1740 
ccaagaagag atggtaaaaa gatatataaa tacatatcct tcbaagcaga ttcttcctac ISOO* 
tgcaaaggac agtgaatcca gcaactcagt ggatccaagc tgggctcaga ggtcggaagg 1860 
agggtagagc acactgggag gtttcatctt ggaggattcc tcagcaggat acttcagcca 1920 
ttttagtaat gcaggtctgt gatttggggg atagaaaaca aagtacctat gaaacgggat 1980 
atctggattt tacttgcagt ggcttccacc gatgggccaa tcttctcatt tcttagtgcc 2040 
tcagacat cc catatgtaaa atgagagtaa taaaacttgg cttctctcta cctctcagca 2100 
ctaatgatgg tcaaatgcct tacatctttt ctgatatctc taaaatgctg ttaagttctg 2160 
gagaagaact tcaggagaag aagatctatc agctggcttt taaagaccta tgacaacatg 2220 
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aaagtggtgt tcagcctgga atgctttgtc agagatgggt gtggatttag gttatactgg 2280 
gggagaactt ttctcagcac agattctatg ccagcttctt tgggcttgtt ctgtcactat 2340 
ctttttgttt atgattttag tttttacttt ttgtagatgt gggatgaagt ggactctgtc 2400 
gtgtatattg aggaaaaaag aaattataat tttaaaeiaat cccfctgtagg attattatct 2460 
aaatttafcat gtctaacttc tactacaact acaggaacag tgagccttgc tacttcttta 2520 
gtagcttctt ggcagaattc ctttctactg agttatttgc aaagatgcag ctctaccttt 2580 
ttacttaagg cctgaatggt gagcatgggg attttgatac tgggactcat caggaaagga 2640 
ttctgctttc aaactatact gaacattcct gtcctagcgt ccctgccacc aggcccaatg 2700 
catctgatcc ttgaatatac tctcaaagaa ttcactctct ttttattaag agaactaaat 2760 
tgtttctaaa tgtagatggt ccctctggaa aagcagtttt cagcaggggt ggtaacccct 2820 
tcagagggag tttggaaatg tgtgggtatg attcttggtt atcataatga tgggggtgct 2880 
actggccttc tgctgccatg ggaccaggat gctaaatgtc aaggtagtcc tatacagtga 2940 
agaattgtcc tgctcaagat gccaggattt cccccagtga gaacatgctc taaggaatga 3000 
ccaccccttt cttttattct cccacagtgc tccatgtaca gaagtaagca tagcagtcat 3060 
atgagcaacc acattcctga acctttcctc atgctggctc tacacttaat cctttacttg 3120 
tatgtttctg taattcttac ataaattcta ttaagagggt ggcatactgt agtggatgaa 3180 
gctgaggctt atagtaggta aggcaceuuig ttaaaaagta acatcactgg gtttcaaacc 3240 
tactggtctc tgtgactaaa gaacactttc agaaccactt cttgattctg ccaccacttg 3300 
atcccataac aggctacccc ttggcctcat gctggagttg tgtgtgtctg tcttcatccc 3360 
aggctgagct ccttgaggtg aggatgttgt gctgtttgcc tccctcacag tgccttggtc 3420 
ttagtggatg cccagttgtc ttgtgaatga cttttaagaa gtgtacttaa gagaaaaatc 3480 
ctaccttatt tgciataatta caagtcatgt ttttgttgct taaaggtgat aaatcagtgt 3540 
atattatttg ttaatgtcea ttaaagccag tttttaeiaaa aaaaaaaaaa aaaaaaaaaa 3600 



36/36 



